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Title: US-09-830-972-32 
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Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1586107 seqs, 282547505 residues 
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Minimum DB seq length: 0 
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Pred. No, is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
AAY71312 

ID AAY71312 standard; protein; 522 AA. 
XX 

AC AAY71312; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurite growth inhibitor Nogo C. 
XX 

KW Rat; neurite growth inhibitor; Nogo C; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 



KW structural plasticity; screening. 
XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT Region 1- -39 

prp /note= "Sequence upstream to the N-terminus of Nogo C 

FT protein" 

FT Misc-dif f erence 3 

FT /note= "Encoded by TAG" 

FT Region 11. .191 

FT /note= "Region specifically described in claim 16" 

FT Misc-dif ference 29 

FT /note= "Encoded by TAA" 

FT Protein 40. .238 

FT /label= Nogo_C_protein 

FT Region 51. .238 

FX /note=^ "C-terminal common region found in Nogo B and C 

FT isof orms " 

FT Region 239. .522 

FT /note= "Sequence downstream to the C-terminus of Nogo C 

FT protein" 

FT Misc-dif ference 239 

FT /note= "Encoded by TGA" 

FT Misc-dif ference 263 

FT /note= "Encoded by TGA" 

FT Misc-dif ference 276 

FT /note= "Encoded by TAG" 

FT Misc-dif ference 281 

FT /note= "Encoded by TGA" 

FT Misc-dif ference 295 

FT /note= "Encoded by TAA" 

FT Misc-dif ference 298 

FT /note= "Encoded by tAa" 

FT Misc-dif ference 314 

FT /note= "Encoded by TGA" 

FT Misc-dif ference 318 

FT /note= "Encoded by TGA" 

FT Misc-dif ference 335 

FT /note= "Encoded by TAG" 

FT Misc-dif ference 371 

FT /note= "Encoded by TGA" 

FT Misc-dif ference 374 

FT /note= "Encoded by TAG" 

FT Misc-dif ference 380 

FT /note= "Encoded by TAA" 

FT Misc-dif ference 406 

FT /note= "Encoded by TAA" 

FT Misc-dif ference 408 

FT /note= "Encoded by TAG" 

FT Misc-dif ference 410 

FT /note= "Encoded by TAA" 

FT Misc-dif ference 422 

FT /note= "Encoded by TAA" 

FT Misc-dif ference 433 

FT /note= "Encoded by TGA" 

FT Misc-dif ference 440 



FT /note= "Encoded by TAG" 

FT Misc-dif f erence 453 

FT /note= "Encoded by TAG" 

FT Misc-dif f erence 465 

FT /note= "Encoded by TAG" 

FT Misc-dif f erence 482 

FT /note- "Encoded by TAG" 

FT Misc-dif f erence 513 

FT /note= "Encoded by TAA" 
XX 

PN WO200031235-A2 . 
XX 

PD 02-JUN-2000, 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E, 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 

DR N-PSDB; AAD01175. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 7; Fig 14; 122pp; English. 
XX 

CC The present sequence is a rat Nogo C protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. Nogo proteins and 

CC fragments displaying neurite growth inhibitory activity are used in the 

CC treatment of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer *s and Parkinson's diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysprolif erative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 522 AA; 



Query Match 96.5%; Score 68 0.5; DB 3; Length 522; 

Best Local Similarity 98.6%; Pred. No. 2.5e-71; 



Matches 138; Conservative 0; Mismatches 1; Indels 1; Gaps 1 
Qy 3 SGEAGVSCLRENFAVYSVSVGMHNL-LLLEGRSWQEMDGQKKHWKDKWDLLYWRDIKKT 61 

I I I I I M I I I I I I I I M I I I I I I I IIIIIIIIIIMIIIIIIIIIIMIIII 

Db 4 SGEAGVSCLRENFAVYSVSVGMHNLXLLLEGRSWQEMDGQKKHWKDKWDLLYWRDIKKT 63 

Qy 62 GWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSDEGHPFRAYL 121 

I I I I I I I I I I M I Ill II I II I I II I MM II 

Db 64 GWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYL 123 

Qy 122 ESEVAISEELVQKYSNSALG 141 

II I I M I I I M I M II I M I 

Db 124 ESEVAISEELVQKYSNSALG 143 



RESULT 2 
ABB81077 

ID ABB81077 standard; protein; 199 AA. 
XX 

AC 7VBB81077; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo-C. 



XX 
KW 
KW 
KW 



Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 
central nervous system; peripheral nervous system; tranquillizer; Nogo; 
vulnerary ; cerebroprotective ; anti- tumour ; antidiabetic ; anticonvulsant ; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; rat; receptor, 
XX 

OS Rattus norvegicus. 
XX 

FN US2002072493-A1. 
XX 

PD 13-JUN-2002 . 
XX 

PF 28-JUN-2001; 2001US-008 9334 8 . 
XX 

PR 19-MAY-199B; 98IL-00124500 . 

PR 21-JUL-1998; 98WO-US014715 . 

PR 22-DEC-1998; 98US-00218277 . 

PR 19-MAY-1999; 99US-00314161 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86600. 

XX ■ . u 

PT Promoting nerve regeneration and preventing neuronal degeneration m the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 



XX 

PS Example 5; Page 48-49; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt-Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy. Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the rat 

CC neurotransmitter receptor protein Nogo-C, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 199 AA; 

Query Match 72.3%; Score 510; DB 5; Length 199; 

Best Local Similarity 99.0%; Pred. No. 9.6e-52; 

103; Conservative 0; Mismatches 1; Indels 0; Gaps 0 

38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSA/TAYIALALLSVTIS 97 

I M I I I I I M I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I M I M I I M I I I I I I I M 

1 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



Matches 

Qy 

Db 

Qy 

Db 



RESULT 3 
ABB89192 

ID ABB89192 standard; protein; 118 7\A. 
XX 

AC ABB89192; 
XX 

DT 24-MAY-2002 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 1568. 



XX 

KW Cytostatic; immunosuppressive; nootropic; neuroprotective; antiviral; 

KW antiallergic; hepatotropic; antidiabetic; antiinflammatory; antiulcer; 

KW vulnerary; anticonvulsant; antibacterial; antifungal; antiparasitic; 

KW cardiant; gene therapy; cancer; immune disorder; cardiovascular disorder; 

KW neurological disease; infection; human; secreted protein. 

XX 

OS Homo sapiens . 
XX 

PN WO200190304-A2. 
XX 

PD 29-NOV-2001. 
XX 

PF 18-MAY-2001; 2001WO-US016450 . 
XX 

PR 19-MAY-2000; 2000US-0205515P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Birse CE, Rosen CA; 
XX 

DR WPI; 2002-122018/16, 

DR N-PSDB; ABL89601. 
XX 

PT Novel 1405 isolated polypeptides, useful for diagnosis, treatment and 

PT prevention of neural, immune system, muscular, reproductive, 

PT gastrointestinal, pulmonary, cardiovascular, renal and proliferative 

PT disorders. 

XX 

PS Claim 11; SEQ ID NO 1568; 2081pp + Sequence Listing; English. 
XX 

CC The invention relates to novel genes (ABL89449-ABL90853 ) and proteins 

CC (ABB89040-ABB90444) useful for preventing, treating or ameliorating 

CC medical conditions e.g. by protein or gene therapy. The genes are 

CC isolated from a range of human tissues disclosed in the specification, 

CC The nucleic acids, proteins, antibodies and ( ant ) agonists are useful in 

CC the diagnosis, treatment and prevention of: (a) cancer, e,g, breast and 

CC ovarian cancer and other cancers of the adrenal gland, bone, bone marrow, 

CC breast, gastrointestinal tract, liver, lung, or urogenital; (b) immune 

CC disorders e.g. Addison's disease, allergies, autoimmune haemolytic 

CC anaemia, autoimmune thyroiditis, diabetes mellitus, Crohn's disease, 

CC multiple sclerosis, rheumatoid arthritis and ulcerative colitis; (c) 

CC cardiovascular disorders such as myocardial ischaemias; (d) wound healing 

CC ; (e) neurological diseases e.g. cerebral anoxia and epilepsy; and (f) 

CC infectious diseases such as viral, bacterial, fungal and parasitic 

CC infections. Note: The sequence data for this patent did not form part of 

CC the printed specification, but was obtained in electronic format directly 

CC from WIPO at f tp , wipo , int/pub/published_pct_sequences 

XX 

SQ Sequence 118 AA; 

Query Match 71.3%; Score 503; DB 5; Length 118; 

Best Local Similarity 98.1%; Pred. No. 3.2e-51; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0 



QY 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 
I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I 



Db 



1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 



Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNS7U.G 141 

I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 4 
AAW53947 

ID AAW53947 standard; protein; 199 AA. 
XX 

AC AAW53947; 
XX 

DT 24-JUL-1998 (first entry) 
XX 

DE Human NSPLP protein A. 
XX 

KW NSPLP; neuroendocrine-specif ic protein-like protein; human; gene therapy; 

KW neurodegenerative disease; amyotrophic lateral sclerosis; cancer, 

XX 

OS Homo sapiens. 
XX 

PN WO9806841-7V2. 
XX 

PD 19-FEB-1998. 
XX 

PF 24-JUL-1997; 97WO-US013469 . 
XX 

PR 12-AUG-1996; 96US-00700607 . 
XX 

PA (INCY-) INCYTE PHARM INC. 
XX 

PI Bandman O, Au-Young J, Goli SK, Hillman J; 
XX 

DR WPI; 1998-159533/14, 

DR N-PSDB; AAV23695. 
XX 

PT Human neuro-endocrine-specif ic protein-like proteins - useful for 

PT diagnosis, monitoring and treatment of cancer and neuro-degenerative 

PT disease, 
XX 

PS Claim 1; Page 38; 73pp; English, 
XX 

CC This sequence is a human neuroendocrine-specif ic protein-like protein 

CC (NSPLP) of the invention. Recombinant cells transformed with the DNA are 

CC used to express the NSPLP proteins, which are used to treat cancer and 

CC neurodegenerative diseases such as amyotrophic lateral sclerosis. Also 

CC antisense nucleic acids and antagonists of NSPLP can be used to inhibit 

CC activity of the NSPLP proteins. Antibodies specific for NSPLP are used 

CC for diagnosis and monitoring treatment of diseases associated with NSPLP 

CC expression, in usual immunoassays, and to isolate NSPLP from natural 

CC sources. The NSPLP proteins, or their fragments can also be used in drug 

CC screening to identify NSPLP antagonists. The nucleic acid can be used 

CC diagnostically and for monitoring treatment (in hybridisation or 

CC amplification assays); to isolate closely related sequences; in gene 

CC therapy for both sense and antisense applications (including use of 

CC ribozymes) and for mapping the natural genomic sequence 



XX 

SQ Sequence 199 AA; 



Query Match 71.3%; Score 503; DB 2; Length 199; 

Best Local Similarity 98.1%; Pred. No. 6.4e-51; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps C 

Qy 38 MDGQKKHWKDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I M : M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I 
Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



AAW78313; 

13-APR-1999 (first entry) 

Fragment of human secreted protein encoded by gene 69. 

Human; secreted protein; fusion protein; gene therapy; protein therapy; 
diagnosis; tissue; cancer; tumour; neurodegenerative disorder; leukaemia; 
developmental abnormality; foetal deficiency; blood; allergy; renal; 
immune system; asthma; lymphocytic disease; brain; hepatic; lymphoma; 
inflammation; ischaemic shock; Alzheimer's disease; restenosis; AIDS; 
cognitive disorder; schizophrenia; prostate; obesity; osteoclast; thymus; 
osteoporosis; arthritis; testis; lung; thyroiditis; thyroid; digestion; 
endocrine; metabolism; regulation; malabsorption; gastritis; neoplasm. 



RESULT 5 
AAW78313 

ID AAW78313 standard; protein; 199 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



Homo sapiens. 




W098568 


04-Al. 




17- 


DEC- 


1998. 






11- 


JUN- 


1998; 


98WO- 


-US012125. 


13- 


JUN- 


1997, 


97US 


-0049547P. 


13- 


JUN- 


1997, 


97US 


-0049548P. 


13- 


JUN- 


1997, 


97US 


-0049549P. 


13- 


JUN- 


1997, 


97US 


-0049550P. 


13- 


JUN- 


1997, 


97US 


-0049566P. 


13- 


JUN- 


1997, 


97US 


-0049606P, 


13- 


JUN- 


1997, 


97US 


-0049607P. 


13- 


JUN- 


1997, 


97US 


-0049608P. 


13- 


JUN- 


1997 


■r 97US 


-0049609P, 


13- 


JUN- 


1997 


; 97US 


-0049610P. 


13- 


JUN- 


1997 


r 97US 


-0049611P. 


13- 


JUN- 


1997 


; 97US 


-0050901P. 


13- 


-JUN- 


1997 


; 97US 


-0052989P. 


08- 


■JUL- 


-1997 


; 97US 


-0051919P. 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



18-AUG- 
12-SEP- 
12-SEP- 
12-SEP- 
12-SEP- 
12-SEP- 
12-SEP- 
12-SEP- 
02-OCT- 
02-OCT- 
02-OCT- 
02-OCT 
02-OCT- 
02-OCT 



-1997 
•1997 
-1997 
-1997 
-1997 
-1997 
-1997 
-1997 
-1997 
-1997 
-1997 
-1997 
-1997 
-1997 



97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 
97US- 



0055984P. 
0058665P. 
0058668P. 
0058669P, 
0058750P. 
0058971P. 
0058972P. 
0058975P. 
0060834P. 
•0060841P. 
■0060844P. 
■0060865P. 
■0061059P. 
•0061060P. 



(HUMA-) HUMAN. GENOME SCI INC. 

Moore PA, Shi Y, Rosen CA, Ruben SM, Lafleur DW, Olsen HS; 
Ebner R, Brewer LA, Young P, Greene JM, Ferrie AM, Yu G, Ni J.- 
Feng P; 

WPI; 1999-080881/07. 
N-PSDB; AAX04379. 

New isolated human genes and the secreted polypeptides they encode - 
useful for diagnosis and treatment of e.g. cancers, neurological 
disorders, immune diseases, inflammation or blood disorders. 

Disclosure; Page 62; 380pp; English. 

This sequence represents a fragment of a secreted human protein encoded 
by the nucleic acid molecule detailed in the descriptor line. The gene 
can be used to generate fusion proteins by linking to the gene to a human 
immunoglobulin Fc portion (e.g. AAX04302) for increasing the stability of 
the fused protein as compared to the human protein only. The invention 
relates to 86 novel genes and their fragments (nucleic acid sequences: 
AAX04311-X04410; amino acid sequences AAW7 8126-W7 8225 ) which are useful 
for preventing, treating or ameliorating medical conditions e.g. by 
protein or gene therapy. Also, pathological conditions can be diagnosed 
by determining the amount of the new polypeptides in a sample or by 
determining the presence of mutations in the new polynucleotides. 
Specific uses are described for each of the 86 polynucleotides, based on 
which tissues they are most highly expressed in (see AAX04311 for 
described uses) 



Sequence 199 AA; 

Query Match 71.3%; 
Best Local Similarity 98.1%; 
Matches 102; Conservative 



Score 503; DB 2; Length 199; 
Pred. No. 6.4e-51; 
1; Mismatches 1; Indels 



0; Gaps 



Qy 

Db 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 

M I I I I : I I I I I I M I I I I M M I I I I M I I I I I I M I I M I M I I I I I I M I I I I M I I 

1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 



97 



60 



98 FRIYKGVIQAI7VKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 6 
AAY35903 

ID AAY35903 standard; protein; 199 AA. 
XX 

AC AAY35903; 
XX 

DT 13-SEP-1999 (first entry) 
XX 

DE Extended human secreted protein sequence^ SEQ ID NO. 152, 
XX 

KW Secreted protein; human; cytokine; cellular proliferation; cell movement; 

KW cellular differentiation; immune system regulator; anti-inflammatory; 

KW haematopoiesis regulator; tissue growth regulator; tumour inhibitor; 

KW reproductive hormone regulator; chemotaxis; chemokinesis ; gene therapy; 

KW genetic disease. 
XX 

OS Homo sapiens. 
XX 

PN W09931236-A2. 
XX 

PD 24-JUN-1999. 
XX 

PF 17-DEC-1998; 98WO-IB002 122 . 
XX 

PR 17-DEC-1997; 97US-0069957P . 

PR 09-FEB-1998; 98US-0074 12 IP . 

PR 13-APR-1998; 98US-0081563P . 

PR lO-AUG-1998; 9 8US-0096116P . 
XX 

PA (GEST ) GENSET. 
XX 

PI Bougueleret L, Duclert A, Dumas Milne Edwards J; 
XX 

DR WPI; 1999-385906/32. 

DR N-PSDB; AAX97587. 
XX 

PT New isolated human secreted proteins. 
XX 

PS Claim 9; Page 185-186; 516pp; English, 
XX 

CC This sequence is encoded by an extended human secreted protein coding 

CC sequence of the invention. The secreted proteins can be used in treating 

CC or controlling a variety of human conditions. The secreted proteins may 

CC act as cytokines or may affect cellular proliferation or differentiation 

CC or may act as immune system regulators^ haematopoiesis regulators, tissue 

CC growth regulators, regulators of reproductive hormones or cell movement 

CC or have chemotactic/chemokinetic, receptor/ligand, anti-inflammatory or 

CC tumour inhibition activity. The DNAs can be used in forensic procedures 

CC to identify individuals or in diagnostic procedures to identify 

CC individuals having genetic diseases resulting from abnormal expression of 

CC the genes corresponding to the extended cDNAs . They are also useful for 

CC constructing a high resolution map of the human chromosomes. They can 

CC also be used for gene therapy to control or treat genetic diseases 
XX 



SQ Sequence 199 AA; 



Query Match 71.3%; Score 503; DB 2; Length 199; 

Best Local Similarity 98.1%; Pred. No. 6.4e-51; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0 

Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I I I : I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I 
Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNS7VLG 141 

I I I I I I I M I I I I I I M I M I I I I I I I I I I M I I I I I I I I I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 7 




AAB12 


! 805 




ID 


AABlzoOo standard; protein; lyy AA. 




XX 






AC 


AAB12 8 05; 




XX 






DT 


24-NOV-zOOO (rirst entry) 




XX 






DE 


Human NSPH protein sequence SEQ ID NO : 4 . 




AA 






KW 


Human; neuroendocrine-specif ic protein; NSPH; 


NSPA; NSPB; NSPC. 


XX 






OS 


Homo sapiens . 




XX 






PN 


CN1253180-A. 




XX 






PD 


17-MAY-2000. 




V V 
AA 






PF 


30-OCT-1998; 98CN-0012147 3 . 




XX 






PR 


30-OCT-19 98; 98CN-00121473 . 




XX 






PA 


(UYFU-) UNIV FUDAN. 




XX 






PI 


Yu L, Zhao Y, Zhang H; 




XX 






DR 


WPI; 2000-466537/41, 




DR 


N-PSDB; 7WV72981. 




XX 






PT 


Specific protein of human neuroendocrine, coding sequence and its 


PT 


preparating process and application. 




XX 






PS 


Claim 4; Page 14-15; 21pp; Chinese. 




XX 






CC 


The present invention relates to a new member 


of the human neuroendocrine 


CC 


specific protein family, designated NSPH. The 


present sequence represents 


CC 


the human NSPH protein 




XX 






SQ 


Sequence 199 AA; 





Query Match 71.3%; Score 503; DB 3; Length 199; 

Best Local Similarity 98.1%; Pred. No. 6.4e-51; 



Matches 



102; Conservative 1; Mismatches 1; Indels 0; Gaps 0 



Qy 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 




Db 



1 MDGQKKNWKDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 



Qy 



98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 




Db 



61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 8 
AAB82348 

ID AAB82348 standard; protein; 199 AA. 
XX 

AC AAB82348; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-C protein. 
XX 

KW NOGO-C; human; chromosome 2p21; neuropathy; spinal injury; brain injury; 

KW stroke; neuronal degeneration; Alzheimer's disease; Parkinson's disease; 

PCW neuromuscular disorder; psychiatric disorder; developmental disorder; 

KW neuroprotective; nootropic; neuroleptic; antiparkinsonian; 

KW cerebroprotective; neuroleptic; diagnosis; therapy, 
XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2000WO-GB004345 . 
XX 

PR 15-NOV-1999; 99GB-00026995 . 

PR 24-JAN-2000; 2000GB-00001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR N-PSDB; AAF90323. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases, 
XX 

PS Claim 3; Page 25; 25pp; English. 
XX 

CC The present sequence is that of human NOGO-C, encoded by a novel splice 

CC variant of the human NOGO gene on chromosome 2p21. 2 Other splice 

CC variants, NOGO-A and NOGO-B, have previously been identified. The 

CC invention provides NOGO-C polypeptides and polynucleotides, and methods 

CC for producing such polypeptides by recombinant techniques. Also disclosed 

CC are methods for utilising NOGO-C polypeptides and polynucleotides in the 



cc treatment of diseases including neuropathies, spinal injury, brain 

CC injury, stroke, neuronal degeneration, for example Alzheimer's disease 

CC and Parkinson's disease, neuromuscular disorders, psychiatric disorders 

CC and developmental disorders. Also provided are methods for identifying 

CC agonists and agonists for use in treating conditions associated with NOGO 

CC -C imbalance, and diagnostic assays for detecting diseases associated 

CC with inappropriate NOGO-C activity or levels 
XX 

SQ Sequence 199 AA; 

Query Match 71.3%; Score 503; DB 4; Length 199; 

Best Local Similarity 98.1%; Pred. No. 6.4e-51; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

M I I M : I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 9 
ABG30939 

ID ABG30939 standard; protein; 199 AA. 
XX 

AC MG30939; 
XX 

DT 21-OCT-2002 (first entry) 
XX 

DE Human NogoC protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 

KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 

PCW neuroblastoma; hyperprolif erative disorder; dysprolif erative disorder; 

KW cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 

KW tissue hypertrophy; central nervous system; axon regeneration; NogoC; 

KW Nogo-associated disease; metastasis. 
XX 

OS Homo sapiens. 
XX 

PN WO200257483-A2. 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-2002; 2002WO-GB000228 . 
XX 

PR 18-JAN-2001; 2 OOlGB-0 00 01312 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPl; 2002-599722/64. 

DR N-PSDB; ABK90135. 



XX 

PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprolif erative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides. 

XX 

PS Disclosure; Page 64; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprolif erative or dysprolif erative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques. The present amino acid sequence represents the human NogoC 

CC protein of the invention 

XX 

SQ Sequence 199 AA; 



Query Match 71.3%; Score 503; DB 5; Length 199; 

Best Local Similarity 98.1%; Pred. No. 6.4e-51; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 
Qy 38 MDGQKKHWKDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

M M I I * I I I I I I I I I I M I I I I I M I I I I I I I I I I i I I I I I I I I I I I I I M I I I I I I I I 

Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 10 
ABB81080 

ID ABB81080 standard; protein; 199 AA. 
XX 

AC ABB81080; 
XX 

DT 05-NOV-2002 (first entry) ' 
XX 

DE Human neurotransmitter receptor protein Nogo-C. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 
KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 



KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; human; receptor. 
XX 

OS Homo sapiens, 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-0089334 8 . 
XX 

PR 19-MAY-1998; 98IL-00124500 . 

PR 21-JUL-1998; 98WO-US014715 . 

PR 22-DEC-1998; 98US-002 18277 . 

PR 19-MAY-1999; 99US-00314 161 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86601. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides , 

XX 

PS Example; Page 57-58; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS). The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzf eldt-Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates) , Charcot-Marie-Tooth disease, ataxia 



CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy. Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the human 

CC neurotransmitter receptor protein Nogo-C, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 199 AA; 

Query Match 71.3%; Score 503; DB 5; Length 199; 

Best Local Similarity 98.1%; Pred. No. 6.4e-51; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0 

Qy 38 MDGQKKHWKDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I 
Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 11 
AAY71559 

ID AAY71559 standard; protein; 199 AA. 
XX 

AC 7\AY71559; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo C/Nogo A proteins derived fragment to construct mutant Nogo-C. . 
XX 

KW Rat; neurite growth inhibitor; Nogo A; Nogo C; neural cell; myelin; CNS; 
KW central nervous system; neoplastic disease; antiproliferative; glioma; 



antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 



KW 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 
KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 
KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 
KW structural plasticity; screening; mutant; mutein. 
XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 
FT Region 1. .11 

FT /note= "Corresponds to residues 40-50 of rat Nogo C 

FT protein shown in AAY71312" 

FT Region 12. .199 

FT /note= "Corresponds to residues 975-1162 of rat Nogo A 

FT protein shown in AAY71310" 

XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US02 6160 . 
XX 

PR 06-NOV-1998; 98US-010744 6P . 
XX 



Nogo proteins and nucleic acids useful for treating neoplastic disorders 
of the central nervous system and inducing regeneration of neurons. 



PA (SCHW/) SCHWAB M E. 
PA- (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 
PT 
PT 
XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblas toma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperproliferative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is derived by fusing two fragments from rat Nogo C and Nogo A proteins. 

CC The fragment is used in the construction of mutant Nogo-C which is 

CC composed of His-tag/T7-tag/Nogo-C N-terminus (11 aa) + Nogo-A sequence aa 

CC 975-1162. Nogo A deletion mutants were used for mapping the inhibitory 

CC sites of Nogo protein. Major inhibitory region was identified in the Nogo 

CC A sequence from amino acids 172-974, particularly amino acids 542-722. In 

CC addition, N-terminal region 1-171 was found to be inhibitory to NIH 3T3 

CC fibroblast spreading. Note: The present sequence is not given in the 

CC specification but is derived from rat Nogo C sequence shown in AAY71312 

CC and Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 

CC numbers 

XX 

SQ Sequence 199 AA; 



Query Match 70.9%; Score 499.5; DB 3; Length 199; 

Best Local Similarity 98.1%; Pred. No. 1.7e-50; 

Matches 103; Conservative 0; Mismatches 1; Indels 1; Gaps 1; 
Qy 38 MDGQKKHWKDK-WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTI 96 

I I I I I I I I I M I I M I I M I I I I I M I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I 

1 ^DGQKKHWKDKSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTI 60 

Qy 97 SFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I M I M I I I M I I I I I I I M I I 

61 SFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 105 



RESULT 12 
AAY71311 

ID AAY71311 standard; protein; 1178 AA. 
XX 

AC AAY71311; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Human neurite growth inhibitor Nogo . 
XX 

KW Human; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening. 

XX 

OS Homo sapiens. 
XX 

f'H Key Location/Qualifiers 

FT Misc-dif ference 187 

FT /label= Unknown 

FT Misc-dif ference 188 

FT /labels Unknown 

FT Misc-dif ference 189 

FT /label= Unknown 

FT Misc-dif ference 190 

FT /label= Unknown 

FT Misc-dif ference 221 

FT /label= Unknown 

FT Misc-difference 328 

FT /label= Unknown 

FT Misc-difference 477 

FT /label= Unknown 

FT Region 977. .1012 

FT /note= "Region specifically described in claim 16" 

FT Region 994. .1174 

FT /note= "Region specifically described in claim 16" 

FT Region 1079. .1114 

/note= "Region specifically described in claim 16" 

XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 
PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 



XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 11; Fig 13; 122pp; English. 
XX 

CC The present sequence is a human Nogo protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The human Nogo 

CC sequence was derived by aligning human expressed sequence tags (ESTs) 

CC e.g. AA158636, AA333267, AA081783, AA167765, AA322918, AA092565, AA081525 

CC and AA081840 with the rat Nogo sequence. Nogo proteins and fragments 

CC displaying neurite growth inhibitory activity are used in the treatment 

CC of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer's and Parkinson's diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysproliferative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1178 AA; 

Query Match 63.5%; Score 448; DB 3; Length 1178; 

Best Local Similarity 74.6%; Pred. No. 2.2e-43; 

Matches 97; Conservative 3; Mismatches 8; Indels 22; Gaps 1; 

Qy 12 RENFAVYSVSVGMHNLLLLEGRSWQEMDGQKKHWKDKVVDLLYWRDIKKTGWFGASLFL 71 
I I - i =1 I I I I I I I I I I I I I I I I I I I I I I I I 

976 RSPSAIFSADLG KTSWDLLYWRDIKKTGWFGASLFL 1013 

Qy 72 LLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEEL 131 

M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1014 LLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEEL 1073 

Qy 132 VQKYSNSALG 141 

I I I I I I I I I I 
Db 1074 VQKYSNSALG 1083 



RESULT 13 
AAY71563 

ID AAY71563 standard; protein; 403 AA. 
XX 

AC AAY71563; 
XX 

DT 02-NOV-2000 (first entry) 



XX 

DE Rat Nogo A protein fragment used in the construction of mutant EST. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer^s disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 , 
XX 

PR 06-NOV-1998; 98US-010744 6P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblas toma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperproliferative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a fragment of rat Nogo A protein shown in AAY71310, which is used in 

CC the construction of mutant EST. The mutant is composed of His-tag/T7- 

CC tag/Nogo-A sequence aa 760-1162. Nogo A deletion mutants were used for 

CC mapping the inhibitory sites of Nogo protein. Major inhibitory region was 

CC identified in the Nogo A sequence from amino acids 172-974, particularly 

CC amino acids 542-722. In addition, N-terminal region 1-171 was found to be 

CC inhibitory to NIK 3T3 fibroblast spreading. Note: The present sequence is 



CC not given in the specification but is derived from rat Nogo A sequence 

CC shown in AAY71310. SEQ ID numbers 35-42 are referred in claim 32 and SEQ 

CC ID NO: 29 in disclosure of the specification. However, the specification 

CC does not include sequences for these SEQ ID numbers 
XX 

SQ Sequence 403 AA; 

Query Match 63,4%; Score 447; DB 3; Length 403; 

Best Local Similarity 96.9%; Pred. No. 6.9e-44; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 214 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAIiALLSVTISFRIYKGVI 273 



Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 





1 1 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


274 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 309 


RESULT 14 




AAY95012 




ID 


AAY95012 standard; protein; 893 AA. 


XX 






AC 


AAY95012; 




XX 






DT 


19-JUN-2000 


(first entry) 


XX 






DE 


Human secreted protein vb22 1, SEQ ID NO: 64. 


XX 






KW 


Human; secreted protein; cancer; tumour; cardiovascular disorder; 


KW 


blood disorder; haemophilia; autoimmune disease; diabetes; inflammation; 


KW 


infection; fungal; bacterial; viral; HIV; allergy; arthritis; 


KW 


neurodegenerative disease; asthma; contraceptive. 


XX 






OS 


Homo sapiens. 


XX 






PN 


WO200011015- 


-Al. 


XX 






PD 


02-MAR-2000 




XX 






PF 


24-AUG-1999; 99WO-US019351 . 


XX 






PR 


24-AUG-1998 


; 98US-0097638P. 


PR 


24-AUG-1998 


; 98US-G097659P. 


PR 


09-SEP-1998, 


; 98US-0099618P. 


PR 


28-SEP-1998, 


98US-0102092P. 


PR 


25-NOV-1998, 


98US-0109978P. 


PR 


23-DEC-1998, 


98US-0113645P. 


PR 


23-DEC-1998, 


98US-0113646P. 


PR 


23-AUG-1999, 


99US-00379246. 


XX 






PA 


(ALPH-) ALPHAGENE INC. 


XX 






PI 


Valenzuela D, Yuan 0, Hoffman H, Hall J, Rapiejko P; 


XX 






DR 


WPI; 2000-224657/19. 



XX 

PT New secreted or transmembrane proteins and polynucleotides encoding them, 

PT useful for treating neurodegenerative disorders, autoimmune diseases and 

PT cancer. 
XX 

PS Claim 73; Page 322-325; 357pp; English. 
XX 

CC The invention relates to 40 human secreted proteins (AAY94981-Y95020) , 

CC and cDNA sequences encoding them (AAA23423-A23462 ) . The secreted proteins 

CC of the invention include those that are thought to be only partially 

CC secreted, i.e., transmembrane proteins. The proteins of the invention may 

CC exhibit one or more activities selected from the following: cytokine 

CC activity; cell proliferation; differentiation; immune modulation; 

CC haematopoiesis regulation; tissue growth activity; activin/inhibin 

CC activity; chemotactic/chemokinetic activity; haemostatic and thrombolytic 

CC activity; anti-inflammatory activity; and tumour inhibition activity. The 

CC proteins may be administered to patients as vaccines, and the nucleotides 

CC may be used as part of a gene therapy regime. Diseases or conditions that 

CC may be treated using the proteins or nucleotides of the invention include 

CC autoimmune diseases; genetic disorders; haemophilia; cardiovascular 

CC diseases; cancer; bacterial, fungal and viral infections, especially HIV; 

CC multiple sclerosis; rheumatoid arthritis; pulmonary inflammation; 

CC Guillain-Barre syndrome; insulin dependent diabetes mellitus; and 

CC allergic reactions such as asthma and anaemia. They may also be used for 

CC treating wounds, burns, ulcers, osteoporosis, osteoarthritis, periodontal 

CC diseases, Alzheimer's disease, Parkinson's disease, Huntington's disease 

CC and amyotrophic lateral sclerosis (ALS). Proteins with activin/inhibin 

CC activity may additionally be useful as contraceptives. Nucleic acid 

CC sequences of the invention may be used in chromosome mapping, and as a 

CC source of diagnostic primers and probes. The present sequence represents 

CC one of the 40 proteins of the invention 

XX 

SQ Sequence 893 AA; 

Query Match 63.4%; Score 447; DB 3; Length 893; 

Best Local Similarity 96.9%; Pred. No. 2e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 703 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 762 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I M I I I I II II M I M I I I I I M I II II I I II 
Db 763 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNS7VLG 798 



RESULT 15 
ABU11573 

ID ABU11573 standard; protein; 983 AA. 
XX 

AC ABU11573; 
XX 

DT 12-FEB-2003 (first entry) 
XX 

DE Human MDDT polypeptide SEQ ID 520. 
XX 



KW MDDT; human; disease detection and treatment molecule polypeptide; 

KW anti-inflammatory; immunosuppressive; osteopathic; cytostatic; anti~HIV; 

KW haemostatic; nephrotropic; antianaemic; antipsoriatic; hepatotropic; 

KW gene therapy; protein replacement therapy; cell proliferative disorder; 

KW cancer; adenocarcinoma; leukaemia; lymphoma; melanoma; myeloma; sarcoma; 

KW anaemia; Crohn's disease; acquired immunodeficiency syndrome; AIDS; 

KW Goodpasture's syndrome; inflammation; osteoporosis; thrombocytopaenia; 

KW psoriasis; hepatitis. 

XX 

OS Homo sapiens. 
XX 

PN WO200279449-A2. 
XX 

PD lO-OCT-2002. 
XX 

PF 27-MAR-2002; 2002WO-US009944 . 
XX 

PR 28-MAR-2001; 2001US-0279619P . 

PR 29-MAR-2001; 2001US-0280067P . 

PR 29-MAR-2001; 2001US-0280068P . 

PR 16-MAY-2001; 2001US-0291280P . 

PR 17-MAY-2001; 2001US-0291829P . 

PR 17-MAY-2001; 2001US-0291849P . 

PR 19-JUN-2001; 2001US-0299428P . 

PR 20-JUN-2001; 2001US-029977 6P . 

PR 20-JUN-2001; 2001US-0300001P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Daffo A, Jones AL, Tran AB, Dahl CR, Gietzen D, Chinn J; 

PI Dufour GE, Hillman JL, Yu JY, Tuason O, Yap PE, Amshey SR; 

PI Daugherty SC, Dam TC, Liu TF, Nguyen DA, Kleefeld Y, Gerstin EH; 

PI Peralta CH, David MH, Lewis SA, Chen AJ, Panzer SR, Harris B; 

PI Flores V, Marwaha R, Lo A, Lan RY, Urashka ME; 

XX 

DR WPI; 2003-058431/05. 

DR N-PSDB; ABX34563. 
XX 

PT New purified disease detection and treatment molecule proteins and 

PT polynucleotides, useful for diagnosing, treating or preventing cancers 

PT (e.g. leukemia or sarcoma), anemia, Crohn's disease, AIDS, osteoporosis 

PT or hepatitis. 

XX 

PS Claim 27; SEQ ID NO 520; 339pp + Sequence Listing; English. 
XX 

CC This invention describes a novel disease detection and treatment molecule 

CC polypeptide (MDDT) which has anti-inflammatory, immunosuppressive, 

CC osteopathic, cytostatic, anti-HIV, haemostatic, nephrotropic, 

CC antianaemic, antipsoriatic and hepatotropic activity. The polynucleotides 

CC and the polypeptides of the invention can be used for gene therapy, 

CC protein replacement therapy and are useful for treating a variety of 

CC diseases or conditions. These polypeptides or polynucleotides are 

CC particularly useful for diagnosing, treating or preventing cell 

CC proliferative disorders (e.g. cancers including adenocarcinoma, 

CC leukaemia, lymphoma, melanoma, myeloma or sarcoma), anaemia, Crohn's 

CC disease, acquired immunodeficiency syndrome (AIDS), Goodpasture's 

CC syndromes, inflammation, osteoporosis, thrombocytopaenia, psoriasis or 



CC hepatitis. ABU11450-ABU11845 represent the MDDT polynucleotides encoded 

CC by ABU11450-7VBU11845, described in the disclosure of the invention. NOTE: 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format from WIPO at 

CC ftp . wipo , int/pub/ published_pct_sequences 

XX 

SQ Sequence 983 AA; 

Query Match 63.4%; Score 447; DB 6; Length 983; 

Best Local Similarity 96.9%; Pred. No. 2.3e-43; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGA3LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 793 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 852 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I II II I II I I I I I I I I I I I I I I I II I I I I I I 
Db 853 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 888 



Search completed: September 29, 2004, 18:14:02 
Job time : 13.3745 sees 



GenCore version 5,1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

on: September 29, 2004, 18:09:33 ; Search time 3.50035 Seconds 

(without alignments) 
2079.581 Million cell updates/sec 

Title: US-09-8 3 0-972-32 

Perfect score: 705 

Sequence: 1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB . pep : * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep : * 

3 : / cgn2_6/ptodata/2/iaa/6A_COMB.pep: * 

4 : / cgn2_6/ptodata/2/iaa/6B_COMB.pep : * 

5 : /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep: * 

6: / cgn2_6/ptodata/2/iaa/backf lies 1 .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


503 


71. 


3 


199 


2 


us- 


08- 


700-607-1 


Sequence 


1, Appli 


2 


349 


49. 


5 


208 


2 


us- 


08- 


700-607-7 


Sequence 


7, Appli 


3 


348 


49. 


4 


267 


2 


us- 


08- 


700-607-8 


Sequence 


8, Appli 


4 


337 


47. 


8 


356 


2 


us- 


08- 


700-607-6 


Sequence 


6, Appli 


5 


337 


47. 


8 


776 


2 


us- 


08- 


700-607-5 


Sequence 


5, Appli 


6 


305 


43. 


3 


241 


2 


us- 


08- 


700-607-3 


Sequence 


3, Appli 


7 


227 


32. 


2 


168 


4 


us- 


09- 


149-476-563 


Sequence 


563, App 


8 


99 


14. 


0 


80 


3 


us- 


08- 


905-223-411 


Sequence 


411, App 


9 


75 


10. 


6 


593 


4 


us- 


09- 


328-352-4866 


Sequence 


4866, Ap 


10 


72.5 


10. 


3 


598 


2 


us- 


08- 


853-659A-53 


Sequence 


53, Appl 


11 


71.5 


10. 


1 


154 


1 


us- 


08- 


366-783-5 


Sequence 


5, Appli 



12 


70 


9 


. 9 


518 


4 


us- 


-09-134-001C-4744 


Sequence 


4744^ Ap 


13 


70 


9 


.9 


563 


4 


us- 


-09-422-936-79 


Sequence 


79, AddI 


14 


70 


9 


.9 


619 


3 


US- 


-08-262-220-6 


Sequence 


6, Appli 


15 


70 


9 


. 9 


619 


3 


US- 


'08-471-733-6 


Sequence 


6 Appli 


16 


70 


9 


.9 


619 


3 


US- 


-08-468-878-6 


Sequence 


6, Appli 


17 


70 


9 


. 9 


619 


3 


us- 


-08-750-494-6 


Sequence 


6, Appli 


18 


70 


9 


.9 


619 


4 


us- 


-08-470-638-6 


Sequence 


6^ Appli 


19 


70 


9 


.9 


844 


4 


us- 


-09-422-936-47 


Sequence 


47, AppI 


20 


70 


9 


.9 


844 


4 




-09-422-936-51 


Sequence 


51, AppI 


21 


70 


9 


.9 


886 


4 


us- 


-09-422-936-77 


Sequence 


77, AppI 


22 


70 


9 


. 9 


892 


4 


us- 


-09-422-936-75 


Sequence 


7 S Ann! 


23 


70 


9 


.9 


899 


4 


US- 


'09-422-936-71 


S equence 


7 1 Ann 1 


24 


70 


9 


.9 


960 


4 


us- 


-09-422-936-45 


Sequence 


4 Ann 1 


25 


70 


9 


9 


961 


4 


us- 


-09-422-936-49 


Sequence 


4 9 Ann 1 


26 


70 


9 


9 


961 


4 


us- 


09-914-259-14 


Sequence 


1 4 Annl 


27 


69 


9 


8 


621 


3 


us- 


08-262-220-8 


Semi en r'e 




28 


69 


9 


8 


621 


3 


us- 


08-471-733-8 


S equence 


8 Ann1 i 


29 


69 


9 


8 


621 


3 


us- 


08-468-878-8 


Sequence 


8 Annl i 


30 


69 


9 


8 


621 


3 


us- 


08-750-494-8 


kJ v3- LI. ^ H V_» C 




31 


69 


9, 


8 


621 


4 


us- 


08-470-638-8 


S F» m ] p'n r'fi 

vz;x i v3 


R Annl i 


32 


68 


9. 


6 


344 


4 


us- 


09-107-532A-68 8 6 


5^ m 1 "n i=i 
tj c \^ Li c; 1 i c 


^ftPf% Ar>i 


33 


68 


9. 


6 


1447 


3 


us- 


09-041-886-25 






34 


68 


9. 


6 


1447 


5 


PCT 


-US94-05277-2 




9 Annl 1 


35 


67.5 


9. 


6 


659 


4 


us- 


09-328-352-6021 


^ U. C. i i o " 


f^C\9^ An 


36 


67 


9. 


5 


231 


4 


us- 


09-198-452A-419 


SecTuence 


4 T 9 Ann 


37 


67 


9. 


5 


493 


4 


us- 


09-540-236-2120 


SecTuen ce 


2120 An 


38 


67 


9. 


5 


507 


4 


us- 


09-328-352-7742 


5^ m 1 n (=> 

^ \J U ^11 vZI 


77 4 9 An 


39 


66.5 


9. 


4 


249 


4 


us- 


09-107-532A-67 06 




f^7nf^ An 


40 


66 


9. 


4 


445 


4 


us- 


09-328-352-4714 


Sequence 


4714, Ap 


41 


65.5 


9. 


3 


187 


2 


us- 


08-846-021A-5 


Sequence 


5, Appli 


42 


65.5 


9. 


3 


234 


1 


us- 


08-366-783-4 


Sequence 


4, Appli 


43 


65.5 


9. 


3 


254 


2 


us- 


08-767-026-7 


Sequence 


1, Appli 


44 


65.5 


9. 


3 


254 


4 


us- 


09-319-275A-7 


Sequence 


7, Appli 


45 


65 


9. 


2 


424 


4 


us- 


09-543-681A-7510 


Sequence 


7510, Ap 



ALIGNMENTS 



RESULT 1 
US-08-700-607-1 

; Sequence 1, Application US/08700607 

; Patent No. 5858708 

; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 

APPLICANT: Au-Young, Janice 

APPLICANT: Goli, Surya K. 

APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
; CITY: Palo Alto 

STATE: CA 
COUNTRY: U.S. 
; ZIP: 94304 



COMPUTER RE7VDABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
FILING DATE: Filed Herewith 
; ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

; REGISTRATION NUMBER: 36,749 

REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 199 amino acids 

TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
; LIBRARY: 
; CLONE: Consensus 

US-08-700-607-1 



Query Match 71.3%; Score 503; DB 2; Length 199; 

Best Local Similarity 98.1%; Pred. No. 2.9e-52; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps C 

MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 
I I I I I I H I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI7VIALLSVTIS 60 

FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Qy 


38 


Db 


1 


Qy 


98 


Db 


61 



RESULT 2 
US-08-700-607-7 

Sequence 1 , Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Sandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICT^NT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 



ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J, 
REGISTRATION NUMBER: 36,749 
REFERENCE/ DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-8 55-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 208 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307311 
US-08-700-607-7 

Query Match 49.5%; Score 349; DB 2; Length 208; 

Best Local Similarity 63,4%; Pred. No. 8.6e-34; 

Matches 64; Conservative 19; Mismatches 18; Indels 0; Gaps 0 

Qy 37 EMDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTI 96 

Ml : I I : : I I I I I I I I I : I I : I I I : I I I Ml M : I I I I I : I I I II II 

Db 9 KMDCWSNWKSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLAIAALSATI 68 

Qy 97 SFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

I I I II I I : I I : I : I I I I I I : I I I I I : : I : I : I M : : 
Db 69 SFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 109 



RESULT 3 
US-08-700-607-8 

Sequence 8, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 



COUNTRY: U.S. 
; ZIP: 94304 

COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
; SOFTWARE: FastSEQ Version 1.5 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 
FILING DATE: Filed Herewith 
; ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 8: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 2 67 amino acids 
; TYPE: amino acid 

; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
; LIBRARY: GenBank 

; CLONE: 281046 

US-08-700-607-8 



Query Match 49.4%; Score 348; DB 2; Length 267; 

Best Local Similarity 64.0%; Pred. No, 1.6e-33; 

Matches 64; Conservative 18; Mismatches 18; Indels 0; Gaps ( 

Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

II : I I : : I I M II I II : II : I I I : I II I II I I : I II I I : II I II Ml 

Db 1 MDCVWSNWKSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

I I I I I I : II : I : I I I I M : I I I I I : : I : I : I I I : : 
Db 61 FRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 100 



RESULT 4 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au-Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L, 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 



STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 356 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307309 
US-08-700-607-6 

Query Match 47.8%; Score 337; DB 2; Length 356; 

Best Local Similarity 67.4%; Pred. No. 5e-32; 

Matches 62; Conservative 16; Mismatches 14; Indels 0; Gaps 0 

Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I : i I I I I I I I I : I I : I I I : I II III I I : I I I I I : | || | | I I I I I I || | : 
Db 166 KQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 225 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

II: I : I I I I M : II II I : : I : I : || I : : 
Db 226 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 257 



RESULT 5 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INAAENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 



; CITY: Palo Alto 

; STATE: CA 

COUNTRY : U.S. 
ZIP: 94304 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/700,607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 776 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
; MOLECULE TYPE: peptide 

IMMEDIATE SOURCE: 
; LIBR7VRY: GenBank 

CLONE: 307307 
US-08-700-607-5 



Query Match 47.8%; Score 337; DB 2; Length 776; 

Best Local Similarity 67.4%; Pred. No. 1.5e-31; 

Matches 62; Conservative 16; Mismatches 14; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

_ I I :|||||||||:||:|M: I II III 11:111 11:111 II Mlliiii I: 
Db 586 KQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 645 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

II: I : I I II I I : I II I I : : I : I : || | :: 
Db 64 6 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 677 



RESULT 6 
US-Q8-700-607-3 

Sequence 3, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 



STREET; 3174 Porter Drive 

CITY: Palo Alto 

STATE: CA 

COUNTRY: U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/700,607 

FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 

NAME: Billings, Lucy J. 
; REGISTRATION NUMBER: 36,749 

REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 241 amino acids 

TYPE: amino acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

LIBRARY: THPlNOBOl 

CLONE: 31870 
US-08-700-607-3 



Query Match 43.3%; Score 305; DB 2; Length 241; 

Best Local Similarity 60.9%; Pred, No. 2e-28; 

Matches 56; Conservative 17; Mismatches 19; Indels 0; Gaps 0 

Qy 49 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 108 

I II- I I I : I I M III : I : I I I I I I : : II : I : I I I I II I I I | | M I Mil: 
48 VHDLIXWRDVKKTGFVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRIYKSVIQAV 107 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSAL 14 0 

I I : I I I M : I I I : : : : I I I I : | : 

Db 108 QKSEEGHPFKAYLDVDITLSSEAFHNYMNAAM 139 



RESULT 7 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

CURRENT APPLICATION NUMBER: US/09/149,476 
; CURRENT FILING DATE: 1998-09-08 

EARLIER APPLICATION NUMBER: PCT/US98/04 493 
; EARLIER FILING DATE: 1998-03-06 



EARLIER APPLICATION NUMBER: 60/040,162 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,333 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/038,621 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,626 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,334 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,336 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/040,163 
EARLIER FILING DATE: 1997-03-07 
EARLIER APPLICATION NUMBER: 60/047,600 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,615 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,597 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,502 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,633 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,583 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,617 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,618 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,503 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
E7VRLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,58 0 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,589 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,593 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,614 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,578 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,576 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/047,501 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,670 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/056,632 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,664 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,876 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,881 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,909 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,875 



; EARLIER FILING DATE: 1997-08-22 

; E7VRLIER APPLICATION NUMBER: 60/056, 862 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,887 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,908 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/048,964 

; EARLIER FILING DATE: 1997-06-06 

; EARLIER APPLICATION NUMBER: 60/057,650 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/056,884 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/057,669 

; E7VRLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/049,610 

; EARLIER FILING DATE: 1997-06-13 

; EARLIER APPLICATION NUMBER: 60/061,060 

; EARLIER FILING DATE: 1997-10-02 

Query Match 32.2%; Score 227; DB 4; Length 168; 

Best Local Similarity 60.0%; Pred. No. 2.7e-19; 

Matches 42; Conservative 14; Mismatches 14; Indels 0; Gaps 0 

Qy 71 LLLSLTVFSIVSVTAYI7UALLSVTISFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEE 130 

HIM M : : I I : I : I I I I I I I I I I I I M Mil: I h II II M II I : : : : I I 
Db 1 MLLSLAAFSVISWSYLILALLSVTISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSE 60 

Qy 131 LVQKYSNSAL 140 

I I : I : 

Db 61 AFHNYMNAAM 7 0 



RESULT 8 

US-08-905-223-411 

; Sequence 411, Application US/08905223 

; Patent No. 6222029 

; GENERAL INFORMATION: 

; APPLICANT: Edwards, Jean-Baptiste D. 

APPLICANT: Duelert, Aymeric 
; APPLICANT: Lacroix, Bruno 

TITLE OF INVENTION: 5' ESTs FOR SECRETED PROTEINS 
NUMBER OF SEQUENCES; 503 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 501 West Broadway 
CITY: San Diego 
STATE: California 
COUNTRY: USA 
; ZIP: 92101-3505 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy Disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: Win95 
SOFTWARE: Word 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 905 , 223 



FILING DATE: 
CLASSIFICATION: 536 
ATTORNEY/ AGENT INFORMATION: 
NAME: Israelsen, Ned A. 
REGISTRATION NUMBER: 29,655 
REFERENCE/DOCKET NUMBER: 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (619) 235-8550 
TELEFAX: (619) 235-0176 
INFORMATION FOR SEQ ID NO: 411: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 80 amino acids 
TYPE: AMINO ACID 
TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
ORIGINAL SOURCE: 

ORGT^NISM: Homo Sapiens 
TISSUE TYPE: Brain 
FEATURE : 

NAME/ KEY: sig_peptide 
LOCATION: -78,.-l 

IDENTIFICATION METHOD: Von Heijne matrix 
OTHER INFORMATION: score 6.3 
OTHER INFORMATION: seq TLIMLLSWQLSVS/SV 
US-08-905-223-411 

Query Match 14.0%; Score 99; DB 3; Length 80; 

Best Local Similarity 58.8%; Pred, No. 0.0002; 

Matches 20; Conservative 7; Mismatches 5; Indels 2; Gaps 

Qy 49 WDLLYWRDIKKTGWFGASLFLLLS — LTVFSI 8 0 

I I I : : I ! I : I I I I III : I : I I I I : I I : 
Db 47 VHDLIFWRDVKKTGFVFGTTLIMLLSWQLSVSSV 80 



RESULT 9 

US-09-328-352-4866 

; Sequence 4866, Application US/09328352 

; Patent No. 6562958 

; GENERAL INFORMATION: 

; APPLICANT: Gary L. Breton et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
ACINETOBACTER 

; TITLE OF INVENTION: BAUMANNII FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: GTC99-03PA 

; CURRENT APPLICATION NUMBER: US/ 09/32 8 , 352 

CURRENT FILING DATE: 1999-06-04 
; NUMBER OF SEQ ID NOS: 8252 
; SEQ ID NO 4866 

LENGTH: 593 

TYPE: PRT 

; ORGANISM: Acinetobacter baumannii 
US-09-328-352-4866 

Query Match 10.6%; Score 75; DB 4; Length 593; 

Best Local Similarity 27.5%; Pred. No. 2.5; 

Matches 25; Conservative 19; Mismatches 27; Indels 20; Gaps 



Qy 



55 



WRDI KKT — GWFGAS LFLLLS LTVFS IV SVTAYI ALALLS VT I S FRI YKGVI QA 107 



Db 



258 



I I • I • I I • I I - • • - I i I I . I . I . I . I j . . I 

WRGVKETWPAVLVGGGAFAIAQYLTSNFIGPELPDITAAIA-SLVSLTLLFRVWK 311 



Qy 



108 



IAKSDEGHPFRAYLESEVAISEE--LVQKYS 136 



Db 



312 



PKHIFRFEPEAGQTLAQQPTTVQRYS 337 



RESULT 10 

US-0 8-853-659A-53 

; Sequence 53, Application US/08853659A 
; Patent No, 5925522 

GENERAL INFORMATION: 

APPLICANT: Wong, K*K.; Saffer, J.D. 
; TITLE OF INVENTION: A Salmonella Sequence, Methods Of Detection 

TITLE OF INVENTION: Of A 

TITLE OF INVENTION: Salmonella Sequence, And Methods Of Detection Of 
Salmonella 

NUMBER OF SEQUENCES: 67 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Paul W. Zimmerman 

; ADDRESSEE: Intellectual Property Services 

; ADDRESSEE: Battelle Memorial Institute 

ADDRESSEE: PNNL P.O. Box 999 

STREET: Washington Way 

CITY: Richland 
; STATE : Washington 

COUNTRY: U.S.A. 

ZIP: 99352 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 3.50", 1.44 Mb storage 

COMPUTER: IBM PC/XT/AT 

OPERATING SYSTEM: MS-DOS 
; SOFTWARE: Word Processor (WordPerfect 5.1) 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/853, 659A 

FILING DATE: Unknown 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: none 

FILING DATE: n/a 
; INFORMATION FOR SEQ ID NO: 53: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 598 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
US-08-853-659A-53 

Query Match 10.3%; Score 72.5; DB 2; Length 598; 

Best Local Similarity 22.5%; Pred. No. 5; 

Matches 25; Conservative 22; Mismatches 53; Indels 11; Gaps 2; 

Qy 41 QKKHWKDKWDLLYWRDIKKTGWFGASLF-LLLSLTVFSIVSVTAYIALAL 91 

: I h : : I : : I I I : I : : : I I : I : | | 

Db 14 9 KKKNGRSMSSAFVLWNEFQKIKPVLLNSIFQRIADIPIFIIFLIVIYVNLGLWIVPITM 208 



92 —LSVTISFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSAL 14 0 
: I ^ i I : : : I I I I I I I : I : : : : | | 

Db 209 FIVSIIISLVNHHYTNELMNKQKEGQKNRNIFISEVFLSIKMIHTLNNQGL 259 



RESULT 11 
US-08-366-783-5 

; Sequence 5, Application US/08366783 
; Patent No. 5650554 
GENERAL INFORMATION: 

APPLICANT: Moloney, Maurice M 

TITLE OF INVENTION: Oil-Body Proteins As Carriers 
TITLE OF INVENTION: High-Value Peptides In Plants 
NUMBER OF SEQUENCES: 22 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: DEHLINGER & ASSOCIATES 

STREET: 35 0 CAMBRIDGE AVENUE, SUITE 250 
CITY: PALO ALTO 
STATE: California 
COUNTRY: United States 
; ZIP: 94025-1536 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/366,783 
; FILING DATE: 

CLASSIFICATION: 800 
ATTORNEY/AGENT INFORMATION: 
NAME: FABIAN, GARY 
REGISTRATION NUMBER: 33,875 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-324-0880 
TELEFAX: 415-324-0960 
INFORMATION FOR SEQ ID NO: 5: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 154 amino acids 
; TYPE: amino acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-366-783-5 



Query Match 10.1%; Score 71.5; DB 1; Length 154; 

Best Local Similarity 26.2%; Pred. No. 1; 

Matches 37; Conservative 16; Mismatches 49; Indels 39; Gaps 5 

Qy 33 RSWQEMDGQKKHWKDKWDLLYWRDIKK— TGWFGASLFLLLSLT 7 6 

I =11^ ' II : I I I I I 

^ RDQYQMSGRGSDYSKS RQIAKAATAVTAGGSLLVLLSLTLVGTVIALTVATPL 58 

Qy 77 — VFSIVSVTAYIALALL SVTISFRI YKG-VIQAIAKSDEGHPFRAY 120 

_ HI : I I I :||| ::|: ||| :|: ||: | 

Db 59 LVIFSPILVPALITVALLITGFLSSGGFGIAAITVFSWIYKYLLIEHPQGSDKLDSARMK 118 



Qy 121 LESEVAISEELVQKYSNSALG 141 

II: :: I I I 

Db 119 LGSKAQDLKDRAQYYGQQHTG 139 



RESULT 12 

US-09-134-001C-4744 

; Sequence 4744, Application US/09134001C 
; Patent No. 6380370 
; GENERAL INFORMATION: 

; APPLICANT: Lynn Doucette-Stamm at al 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
STAPHYLOCOCCUS 

; TITLE OF INVENTION: EPIDERMIDIS FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: GTC-007 

; CURRENT APPLICATION NUMBER: US/ 09/ 134 , OOlC 

; CURRENT FILING DATE: 1998-08-13 

; PRIOR APPLICATION NUMBER: US 60/064,964 

; PRIOR FILING DATE: 1997-11-08 

; PRIOR APPLICATION NUMBER: US 60/055,779 

; PRIOR FILING DATE: 1997-08-14 

; NUMBER OF SEQ ID NOS : 5674 

; SEQ ID NO 4744 

; LENGTH: 518 

; TYPE: PRT 

; ORGANISM: Staphylococcus epidermidis 
US-09-134-001C-4744 

Query Match 9.9%; Score 70; DB 4; Length 518; 

Best Local Similarity 21.0%; Pred. No. 8.1; 

Matches 21; Conservative 28; Mismatches 35; Indels 16; Gaps 4 

Qy 16 AVYSVSVGMH NLLLLEGRSWQEMDGQKKHWKDKWDLLYWRDIKK TGWFGAS 68 

I : : : : : I : | | I : I I : I : I II : I : : : I : I I I 

Db 400 AIVAITIAWHPNDTILNLVGNAWA GFGAAFSPLVLYSLYWKDLTRAGAISGMVAGAV 456 

Qy 69 LFLLLSLTVFSIVSVTAYIAL ALLSVTISFRIYK 102 

: : : : : : : | : : : : | | | : : : | 

Db 457 WIVWISWIKPLATINAFFGMYEIIPGFIVSVLITYIVSK 496 



RESULT 13 
US-09-422--936-79 

; Sequence 79, Application US/09422936 

; Patent No. 6465213 

; GENERAL INFORMATION: 

; APPLICANT: Ekstrand, Jonas 

; TITLE OF INVENTION: NEW NUCLEOTIDE SEQUENCES 

; FILE REFERENCE: 06275-165002 

; CURRENT APPLICATION NUMBER: US/ 09/ 422 , 936 

; CURRENT FILING DATE: 1999-10-22 

; PRIOR APPLICATION NUMBER: US 09/242,608 

; PRIOR FILING DATE: 1999-02-19 

; PRIOR APPLICATION NUMBER: PCT/ SE98/ 0194 7 

; PRIOR FILING DATE: 1998-10-27 

; PRIOR APPLICATION NUMBER: SWEDEN 97 03914-2 



PRIOR FILING DATE: 1997-10-27 

PRIOR APPLICATION NUMBER: SWEDEN 9800864-2 
; PRIOR FILING DATE: 1998-03-16 
; PRIOR APPLICATION NUMBER: SWEDEN 9802575-2 
; PRIOR FILING DATE: 1998-07-17 
; NUMBER OF SEQ ID NOS : 85 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 79 
; LENGTH: 563 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-422-936-79 

Query Match 9.9%; Score 70; DB 4; Length 563; 

Best Local Similarity 28.3%; Pred. No. 9.1; 

Matches 26; Conservative 13; Mismatches 35; Indels 18; Gaps 3 

Qy 30 LEGRSWQEMDGQKKHWKDKWDLLYWRDIKKTGWFGAS LFLLLSLTVFS 79 

I : I I : : : : : I III II II I II : I 

Db 144 LQGGSYKKIG YYDSTKDDLSW SKTDKWIGGSPPADQTLVIKTFRFLSQKLFI 195 

Qy 80 IVSVTAYIALALLSVTISFRIYKGVIQAIAKS 111 

I II : : : I I : I I II : : I I 
Db 196 SVSVLSSLGIVLAWCLSFNIYNSHVRYIQNS 227 



RESULT 14 
US-08-262-220-6 

; Sequence 6, Application US/08262220 
; Patent No. 6054296 

GENERAL INFORMATION: 

APPLICANT: BERGSTROM SVEN 
APPLIC7\NT: BARBOUR ALAN G. 
; TITLE OF INVENTION: NEW 66 KDA ANTIGEN FROM BORRELIA 
; NUMBER OF SEQUENCES: 14 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: BROWDY AND NEIMARK 

STREET: 419 SEVENTH STREET, N.W. 
; CITY: WASHINGTON 

COUNTRY: USA 
ZIP: 20004 
COMPUTER READ7VBLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/262, 220 
FILING DATE: 20-JUN-1994 
; CLASSIFICATION: 514 

ATTORNEY/AGENT INFORMATION: 
NAME: COOPER, IVER P. 
REGISTRATION NUMBER: 28,005 
; REFERENCE/DOCKET NUMBER: BERGSTR0M=3 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-628-5197 
; TELEFAX: 202-737-3528 



TELEX: 248633 
; INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 619 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-08-262-220-6 

Query Match 9.9%; Score 70; DB 3; Length 619; 

Best Local Similarity 39.0%; Pred. No. 10; 

Matches 16; Conservative 6; Mismatches 15; Indels 4; Gaps 1; 

Qy 32 GRSWQEMDGQKKHWKDKWD LLYWRDIKKTGWFGAS 68 

I : I : I I : I : I I I I I : I I : I I I I 

Db 384 GLAWNKDDGEKESWKVKGSDSYSTRLFGEQDKKSGVALGIS 424 



RESULT 15 
US-08-471-733-6 

; Sequence 6, Application US/08471733 

; Patent No. 6068842 

; GENERAL INFORMATION: 

APPLIC7\NT: BERGSTROM SVEN 
APPLICANT: BARBOUR ALAN G. 
; TITLE OF INVENTION: NEW 66 KDA ANTIGEN FROM BORRELIA 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: BROWDY AND NEIMARK 

STREET: 419 SEVENTH STREET, N.W. 
; CITY: WASHINGTON 

COUNTRY: USA 
ZIP: 20004 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentin Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/471,733 

; FILING DATE: 06-JUN-1995 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/262,220 

FILING DATE: 20-JUN-1994 
ATTORNEY/AGENT INFORMATION: 

NAME: COOPER, IVER P. 

REGISTRATION NUMBER: 28,005 

REFERENCE/ DOCKET NUMBER: BERGSTR0M=3 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-628-5197 

TELEFAX: 202-737-3528 

TELEX: 248633 
; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 619 amino acids 

; TYPE: amino acid 



TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-08-471-733-6 

Query Match 9.9%; Score 70; DB 3; Length 619; 

Best Local Similarity 39.0%; Pred. No. 10; 

Matches 16; Conservative 6; Mismatches 15; Indels 

Qy 32 GRSWQEMDGQKKHWKDKWD LLYWRDIKKTGWFGAS 68 

I : I : M : I : I I I I I : I I : I I II 

Db 384 GLAWNKDDGEKESWKVKGSDSYSTRLFGEQDKKSGVALGIS 424 



Search completed: September 29, 2004, 18:21:06 
Job time : 4 . 50035 sees 



GenCore version 5.1.5 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



September 29, 2004, 18:05:43 ; Search time 3.45105 Seconds 

(without alignments) 
3930.111 Million cell updates/sec 

US-09-830-972-32 
705 

1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



P1R_78 : ^ 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
160904 

neuroendocrine-specif ic protein C - human 
C; Species : Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C; Accession : 160904 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaeker 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A;Reference number: A46583; MUID : 93293865 ; PMID:7685762 
A; Accession: 1609 04 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-208 <RES> 

A;Cross-references : GB:L10335; NID:g307310; PIDN : AAA59952 . 1 ; PID:g307311 

C; Genetics : 

A; Gene: GDB:RTN1; NSP 



A;Cross-references : GDB:203968; OMIM: 600865 
A;Map position: 14q21-14q22 



Query Match 49.5%; Score 349; DB 2; Length 208; 

Best Local Similarity 63.4%; Pred. No. 5,3e-28; 

Matches 64; Conservative 19; Mismatches 18; Indels 0; Gaps C 

Qy 37 EMDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTI 96 

: I I : I I : : I I I I I I I I I : I I : I I I : I II III I I : I I I I I : I I I II II 

Db 9 KMDCVWSNWKSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYL7VLAALSATI 68 

Qy 97 SFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

II I I I I I : II : I : I M M I : II I I I : : I : I : I I I : : 
Db 69 SFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 109 



RESULT 2 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C;Accession: A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 

A; Title: Developmentally regulated cDNA expressed exclusively in neural tissue 
A;Reference number: A60021; MUID : 91278684 ; PMID:1647480 
A;Accession: A60021 
A; Molecule type: mRNA 
A; Residues: 1-267 <WIE> 

A; Cross-references : EMBL:X52 817; NID:g45654 9; PIDN: CAA370 01 .1; PID:g456550 
C; Comment: This neuronal-specif ic mRNA was identified by hybridization to an 
alpha-tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 49.4%; Score 348; DB 2; Length 267; 

Best Local Similarity 64.0%; Pred. No. 8.9e-28; 

Matches 64; Conservative 18; Mismatches 18; Indels 0; Gaps 0 

Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSA/TIS 97 

II : II : : I II I I I I I I : I I : I I I : I II III I I : I I I I I : I M II III 

Db 1 MDCVWSNWKSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

I I I I I I : I I : I : II M II : I I II I : : I : I : II I : : 
Db 61 FRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 100 



RESULT 3 
A46583 

neuroendocrine-specif ic protein, splice form A - human 

N; Contains: neuroendocrine-specif ic protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A.J.; van de Velde, H.J.; Van Bokhoven, A,; Broers, J.L,; Ramaeker 

F.C. ; Van de Ven, W. J, 

J. Biol. Chem. 268, 13439-13447, 1993 



A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A; Reference nuinber: A46583; MUID: 93293865; PMID: 7685762 
A;Accession: A46583 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-776 <R0E1> 

A;Cross-references: GB:L10333; NID:g307306; PIDN : AAA59950 . 1 ; PID:g307307 
A;Accession: 160903 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 421-776 <R0E2> 

A; Cross-references: GB:L10334; NID:g307308; PIDN : AAA59 951 . 1 ; PID:g307309 

C; Genetics : 

A; Gene: GDB:RTN1; NSP 

A; Cross-references: GDB:203968; OMIM:600865 
A;Map position: 14q21-14q22 

Query Match 47.8%; Score 337; DB 2; Length 776; 

Best Local Similarity 67.4%; Pred. No. 3.7e-26; 

Matches 62; Conservative 16; Mismatches 14; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I : I I I I I I I I I : I I : I I I : I II III I I : I I I I I : I I I M I I M I I II I : 
Db 586 KQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 645 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

II: I : I I I I II : I I I I I : : I : I : I I I : : 
Db 646 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 677 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct~1999 #text_change 15-Oct-1999 
C; Accession: T2 6216 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A; Accession: T2 6216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2484 <WIL> 

A; Cross-references: EMBL:Z78066; PIDN : CAB5 14 67 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP:W06A7 . 3c 

A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 23.0%; Score 162; DB 2; Length 2484; 

Best Local Similarity 31.5%; Pred. No. 7.9e-08; 

Matches 34; Conservative 22; Mismatches 48; Indels 4; Gaps 1; 



Qy 25 HNLLLLEGRSWQEMDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVT 84 



Db 2269 HSILKHHGDAWIDF KTVPPCVLDVIYWRDAKKSAIVLSLALLVLFVLAKYPLLTW 2324 

Qy 85 AYIALALLSVTISFRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELV 132 

I I I I I : : I I I I : I III I : : : : I I 

Db 2325 TYSLLI^GAAAGFRVFKKVEAQIKKTDSEHPFSEILAQDLTLPQEKV 2372 



RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T2 6215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A/Accession: T26215 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2607 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB01522 . 2 ; GSPDB : GN00023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP : W06A7 . 3a 

A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 

Query Match 22.6%; Score 159; DB 2; Length 2 607; 

Best Local Similarity 34,5%; Pred. No. 1.7e-07; 

Matches 30; Conservative 19; Mismatches 38; Indels 0; Gaps 0; 

Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I : I : I : : I I I I I I : : I : I : I I : : : : I I I I I I : : I I 

Db 2409 KKEVLDVIYWRDAKKSAIVLSLALLVLFVTiAKYPLLTVVTYSLLIAiGAAAGFRV 2468 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELV 132 

I I : I III I :: : : I I 
Db 2469 AQIKKTDSEHPFSEILAQDLTLPQEKV 2495 



RESULT 6 
T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T2 6213 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A;Reference number: Z20173 
A; Accession: T2 6213 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-222 <WIL> 

A;Cross-references : EMBL:Z78066; PIDN : CAB01523 . 1 ; GSPDB : GN00023 ; CESP : W06A7 . 3b 
A; Experimental source: clone W06A7 



C; Genetics : 
A; Gene: CESP : W06A7 . 3b 
A;Map position: 5 
A;Introns: 27/1; 77/2; 201/2 



Query Match 22.4%; Score 158; DB 2; Length 222; 

Best Local Similarity 34.1%; Pred. No. 1.4e-08; 

Matches 29; Conservative 19; Mismatches 37; Indels 



0; Gaps 



Qy 

Db 



48 KWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 107 

I : : I : : 1 I I I I I : : I : I : I I : : : : I I i I I I : : I I 

2 6 KILDVIYWRD7VKKSAIVLSLALLVLFVLAKYPLLTVVTYSLLLALGAAAGFRVFKKVEAQ 85 



Qy 



Db 



108 lAKSDEGHPFRAYLESEVAISEELV 132 

I I : I III I : : : : I I 
86 IKKTDSEHPFSEILAQDLTLPQEKV 110 



RESULT 7 
359439 

probable membrane protein YDR233c - yeast ( Saccharomyces cerevisiae) 
N;Alternate names: hypothetical protein YD9934.17c 
C; Species: Saccharomyces cerevisiae 

C;Date: 30-Nov-1995 #sequence_revision 16-Feb-1996 #text_change 19-Apr-2002 
C; Accession: S5 943 9 
R;Murphy, L. ; Harris, D. 

submitted to the EMBL Data Library, March 1995 
A; Reference number: S59423 
A; Access ion: S59439 
A;Molecule type: DNA 
A; Residues: 1-295 <MUR> 

A;Cross-references: EMBL:Z48612; NID:g728671; PID:g728688; GSPDB : GN00004 ; 
MIPS:YDR233c 

A; Experimental source: strain AB972 

C;Genetics: 

A; Gene: MIPS:YDR233c 

A; Cross-references : SGD: S0002641 

A;Map position: 4R 

C; Keywords: transmembrane protein 

F;4 0-56/Domain: transmembrane #status predicted <TMl> 
F; 146-162/Domain: transmembrane #status predicted <TM2> 

Query Match 11.8%; Score 83.5; DB 2; Length 295; 

Best Local Similarity 26.3%; Pred. No. 0.73; 

Matches 25; Conservative 17; Mismatches 32; Indels 21; Gaps 2 

Qy 41 QKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRI 100 

I : : I Mill: : I I II II III : : : : : I =1 I 
Db 12 QQQQQKSCNCDLLLWRNPVQTGKYFGGSLLALLILKKVNLITFFLKVAYTILFTT 66 

Qy 101 YKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKY 135 

I I: ::| : : : |: || 

Db 67 —GSIEFVSK LFLGQGLITKY 85 



RESULT 8 
C88188 



protein C18H9.5 [imported] - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: lO-May-2001 #sequence_revision lO-May-2001 #text_change lO-May-2001 
C;Accession: C88188 

R; anonymous , The C. elegans Sequencing Consortium, 
Science 282, 2012-2018, 1998 

A; Title: Genome sequence of the nematode C. elegans: a platform for 
investigating biology. 

A; Reference number: A75000; MUID : 99069613; PMID: 9851916 
A;Note: see websites genome.wustl.edu/gsc/C_elegans/ and 
www sanger.ac.uk/Projects/C_elegans/ for a list of authors 

A;Note: published errata appeared in Science 283, 35, 1999; Science 283, 2103, 

1999; and Science 285, 1493, 1999 

A; Access ion: C8 8188 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues: 1-464 <STO> 

A;Cross-references: GB:chr_II; PID:g722384; GSPDB: GN00020; CESP:C18H9.5 

C; Genetics : 

A; Gene: C18H9.5 

A;Map position: 2 

Query Match 11.6%; Score 81.5; DB 2; Length 464; 

Best Local Similarity 25.9%; Pred. No. 1.9; 

Matches 38; Conservative 28; Mismatches 50; Indels 31; Gaps 8; 

Qy 8 VSCLRENFAVYSVSVGMHNLLLLEGRSWQEMDGQKKHWKDKWDLLYWRDIKKTGWFG- 66 

: : I 1 : : : I | :: : | : I : : I I : I I : I : I I 

Db 41 ITCTNANMILMNFTVICMNDVIIEQKSF SNQTHWLEKSSDI SLTFSAAAVGAIFGT 96 

Qy 67 ASLFLLLS LTVFSIVSV — TAYIALA LLSVTISFRIYKGV 1 105 

|:| III: ::| I : II |: | |: |:::|| I 

Db 97 VPAVTLISKYGIRKVLTVYGLLSAGGTLLMPLAVNYGLIPVLIA-RLFQGVGASILYSSI 155 

Qy 106 QAIAKS DEGHPFRAYLESEVAIS 12 8 

I : : I : I I I : t I II 

Db 156 GTISESWSPINEIGTFVAFLSSAFQIS 182 



RESULT 9 
T24228 

hypothetical protein R166.2 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T24228 
R; Matthews, P. 

submitted to the EMBL Data Library, August 1995 
A; Reference number: Z19859 
A; Accession: T2422 8 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-618 <WIL> 

A;Cross-references: EMBL:Z50795; PIDN: C7iA90663 . 1 ; GSPDB: GN00020 ; CESP:R166.2 

A; Experimental source: clone R166 

C; Genetics : 

A; Gene: CESP:R166.2 

A;Map position: 2 



A;Introns: 41/3; 86/3; 118/3; 164/3; 207/3; 554/3 



Query Match 11.2%; Score 79; DB 2; Length 618; 

Best Local Similarity 29.1%; Pred. No. 4.7; 

Matches 32; Conservative 22; Mismatches 32; Indels 24; Gaps 6; 

Qy 51 DLLYWRDIKK-TGWFGASLF-LLLSLTVF SIVSVTAYIALAL 91 

I : : I : I I : : I I : I I I I : : I I I I I : 

Db 353 DIQFWNNRKDLVGLSVRSVLFNIFQSLIVFLYICDNETNTMVKVTVGIGLLIECWKIPKV 412 

Qy 92 LSVTISFR-IYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSAL 140 

: : I : I : : : I I I : I I : I : I : I I I I : : : II II 

Db 413 MNVSIDWQNKWFGVIPRLVISDKG SYVESETKIYDQMAFKYLGWAL 458 



RESULT 10 
A72258 

hypothetical protein - Thermotoga maritima (strain MSB8) 
C; Species: Thermotoga maritima 

C;Date: ll-Jun-1999 #sequence_revision ll-Jun-1999 #text_change 21-Jul-2000 
C;Accession: A72258 

R;Nelson, K.E.; Clayton, R.A. ; Gill, S.R.; Gwinn, M.L.; Dodson, R.J.; Haft, 
D.H.; Hickey, E.K.; Peterson, J.D.; Nelson, W.C.; Ketchum, K.A. ; McDonald, L.; 
Utterback, T.R.; Malek, J.A. ; Linher, K.D.; Garrett, M.M. ; Stewart, A.M.; 
Cotton, M.D.; Pratt, M.S.; Phillips, C.A.; Richardson, D. ; Heidelberg, J.; 
Sutton, G.G,; Fleischmann, R.D.; White, 0.; Salzberg, S.L.; Smith, H.O.; Venter, 
J.C.; Eraser, CM. 
Nature 399, 323-329, 1999 

A;Title: Evidence for lateral gene transfer between Archaea and Bacteria from 
genome sequence of Thermotoga maritima, 

A; Reference number: A72200; MUID : 99287316; PMID : 10360571 
A; Access ion: A72258 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-458 <ARN> 

A; Cross-references: GB:AE001793; GB:AE000512; NID: g4981963; PIDN : AAD36479 . 1 ; 

PID:g4981972; TIGR:TM1408 

A; Experimental source: strain MSB8 

C; Genetics : 

A; Gene: TM14 08 

Query Match 11.1%; Score 78.5; DB 2; Length 458; 

Best Local Similarity 26.6%; Pred. No. 3.8; 

Matches 42; Conservative 17; Mismatches 34; Indels 65; Gaps 9; 

Qy 14 NFAVY SVSVGMHNLLLLEGRSWQEMDGQKKH WKD 4 7 

II: I : : I I I I : I I II 

Db 196 NFLILSYLRSSIRIGFDFLLL TRKHPQLLFIGYFYYLSIWIDNFIAWKV 244 

Qy 48 KWDLL YWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFR 99 

I ::: I I I I I I II I : I I I |: I 1=1 

Db 245 KGIEIAPGFFMSPEY--DIPK FMASLFFIPSLWFN LSMETVFQR 287 

Qy 100 lYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

I I I : : I : I I I : II I : : I I : M 
Db 288 NYKGLMQSIV-SDK — PMRVISENLKKLSLSLRHAFSN 322 



RESULT 11 
F64924 

probable thiosulf ate-dithiol sulfurtransf erase (EC 2.8,1.5) - Escherichia coli 
(strain K-12) 

C; Species: Escherichia coli 

C;Date: 12-Sep-1997 #sequence_revision 17-Sep-1997 #text_change 03-Jun-2002 
C; Accession: F64 924 

R;Blattner, F.R.; Plunkett III, G. ; Bloch, C.A.; Perna, N.T.; Burland, V. ; 
Riley, M. ; Collado-Vides , J.; Glasner, J.D.; Rode, C.K.; Mayhew, G.F.; Gregor, 
J.; Davis, N.W.; Kirkpatrick, H.A. ; Goeden, M.A. ; Rose, D.J.; Mau, B. ; Shao, Y 
Science 277, 1453-1462, 1997 

A; Title: The complete genome sequence of Escherichia coli K-12. 
A; Reference number: A64720; MUID: 97426617; PMID: 9278503 
A;Accession: F64924 

A; Status: nucleic acid sequence not shown; translation not shown 

A; Molecule type: DNA 

A; Residues: 1-261 <BLAT> 

A;Cross-references: GB:AE000262; GB:U00096; NID: gl787955 ; PIDN : AAC74740 . 1 ; 
PID:gl787959; UWGP:bl670 

A; Experimental source: strain K-12, substrain MG1655 
C; Superf amily : hyaC protein 

C; Keywords: sulfurtransf erase; transmembrane protein 
F;30-46/Domain: transmembrane #status predicted <TM01> 
F; 83-99/Domain: transmembrane #status predicted <TM02> 
F; 112-128/Domain: transmembrane #status predicted <TM03> 
F; 187-203/Domain: transmembrane #status predicted <TM04> 
F; 224-240/Domain : transmembrane #status predicted <TM05> 

Query Match 11.0%; Score 77,5; DB 2; Length 261; 

Best Local Similarity 27.6%; Pred, No, 2,6; 

Matches 29; Conservative 16; Mismatches 43; Indels 17; Gaps 4 

Qy 22 VGMHNLLLLEGRSWQEMD-GQKKHWKDKWDLLYWRDIKKTGWFGASLFLLL SLT 76 

: I : I I I I I I : I : I I I : I : I I I : I I : 

Db 44 LGLHALLRARGVKKSATDHGEKIYLYSKAVRLWHWSN ALLFVLLLASGLIN 94 

Qy 77 VFSIVSVTAYIT^iLALLSVTISFRI YKGVIQAIAKSDEGHPFR 118 

I : : I II : I : I : : I : I I II : t 

Db 95 HFAMVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGDNGHHYR 139 



RESULT 12 
T49359 

hypothetical protein BlDl.130 [imported] - Neurospora crassa 
C; Species: Neurospora crassa 

C;Date: 02-Jun-2000 #sequence_revision 02-Jun-2000 #text_change 02-Jun-2000 
C; Accession: T4 9359 

R;Schulte, U. ; Aign, V.; Hoheisel, J.; Brandt, P.; Fartmann, B, ; Holland, R, ; 

Nyakatura, G. ; Mewes, H.W.; Mannhaupt, G, 

submitted to the Protein Sequence Database, May 2000 

A; Reference number: Z25022 

A;Accession: T49359 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-583 <SCH> 

A;Cross-references : EMBL : AL355927 ; GSPDB : GN00116; NCSP : BlDl . 130 



A; Experimental source: BAG clone BlDl; strain OR74A 

C; Genetics : 

A; Gene: NCSP : BlDl . 130 

A; Map position: 6 

A;Introns: 44/1 

Query Match 11.0%; Score 77.5; DB 2; Length 583; 

Best Local Similarity 27.5%; Pred. No. 6.2; 

Matches 28; Conservative 13; Mismatches 28; Indels 33; Gaps 4; 

Qy 52 LLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKS 111 

III : : : : I I I I I I I I : I 11 : I I : I I : 
Db 67 LCYW-PLERRLLVLHALLLLLLSLEHYS AYTRVLLLHITSSLNL 109 

Qy 112 DEGHPFRAYLESEVAIS EELVQKYSNSALG 141 

I I : : I I : : I I I : I I I 

Db 110 PLRVLVDDEVRVAKAIAWMAKDINPEELIQKRIEECAG 147 



RESULT 13 
C95920 

hypothetical membrane protein [imported] - Sinorhizobium meliloti (strain 1021) 
magaplasmid pSymB 

C; Species: Sinorhizobium meliloti 

C;Date: 24-Aug-2001 #sequence_revision 24-Aug-2001 #text_change 30-Sep-2001 
C;Accession: C95920 

R;Finan, T.M. ; Weidner, S.; Wong, K. ; Buhrmester, J.; Chain, P.; Vorholter, 
F.J.; Hernandez-Lucas, I.; Becker, A.; Cowie, A.; Gouzy, J.; Golding, B. ; 
Puhler, A. 

Proc. Natl. Acad. Sci. U.S.A. 98, 9889-9894, 2001 

A; Title: The complete sequence of the 1,683-kb pSymB megaplasmid from the N2- 

fixing endosymbiont Sinorhizobium meliloti. 

A; Reference number: A95842; MUID : 21396508 ; PMID : 11481431 

A;Accession: C95920 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-481 <KUR> 

A; Cross-references: GB:AL591985; PIDN : CAC4 9027 . 1 ; PID: gl5140512 ; GSPDB: GN00167 
A; Experimental source: strain 1021, megaplasmid pSymB 

R;Galibert, F.; Finan, T.M.; Long, S.R.; Puhler, A.; Abola, P.; Ampe, F. ; 
Barloy-Hubler, F.; Barnett, M.J.; Becker, A.; Boistard, P.; Bothe, G. ; Boutry, 
M. ; Bowser, L. ; Buhrmester, J.; Cadieu, E. ; Capela, D.; Chain, P.; Cowie, A.; 
Davis, R.W.; Dreano, S.; Federspiel, N.A.; Fisher, R.F.; Gloux, S.; Godrie, T. 
Goffeau, A.; Golding, B.; Gouzy, J.; Gurjal, M. ; Hernandez-Lucas, I.; Hong, A. 
Huizar, L.; Hyman, R.W. ; Jones, T. 
Science 293, 668-672, 2001 

A;Authors: Kahn, D.; Kahn, M.L.; Kalman, S.; Keating, D.H.; Kiss, E. ; Komp, C. 
Lelaure, V,; Masuy, D.; Palm, C. ; Peck, M.C.; Pohl, T.M.; Portetelle, D. ; 
Purnelle, B.; Ramsperger, U.; Surzycki, R. ; Thebault, P.; Vandenbol, M.; 
Vorholter, F.J.; Weidner, S.; Wells, D.H.; Wong, K. ; Yeh, K.C.; Batut, J. 
A; Title: The composite genome of the legume symbiont Sinorhizobium meliloti. 
A; Reference number: A96039; MUID : 21368234 ; PMID : 11474 104 
A; Contents: annotation 
C; Genetics : 
A;Gene: SMb21048 
A; Genome: plasmid 



Query Match 10.9%; Score 76.5; DB 2; Length 481; 

Best Local Similarity 37.7%; Pred. No. 6.4; 

Matches 23; Conservative 8; Mismatches 27; Indels 3; Gaps 



2; 



Qy 53 LYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSD 112 

: I I I I I I I I I I I : I I I M : I I : I I : :: I : I 

Db 238 VYWRKTKSR — VAQALAFLLLVLLLLSTSSV-AYVGLAVLSIPVALSISWSFLSGRMDKD 2 94 

Qy 113 E 113 

I 

Db 295 E 295 



RESULT 14 
F64665 

glucose-6-phosphate isomerase (EC 5.3.1.9) - Helicobacter pylori (strain 26695) 
C; Species: Helicobacter pylori 

C;Date: 09-Aug-1997 #sequence_revision 09-Aug-1997 #text_change 16-Jul-1999 
C;Accession: F64665 

R;Tomb, J. F.; White, O.; Kerlavage, A.R.; Clayton, R.A. ; Sutton, G.G.; 
Fleischmann, R.D.; Ketchum, K.A. ; Klenk, H.P.; Gill, S.; Dougherty, B.A.; 
Nelson, K. ; Quackenbush, J.; Zhou, L.; Kirkness, E.F.; Peterson, S.; Loftus, B. 
Richardson, D.; Dodson, R. ; Khalak, E.G.; Glodek, A.; McKenney, K. ; Fitzegerald 
L.M.; Lee, N. ; Adams, M.D.; Hickey, E.K.; Berg, D.E.; Gocayne, J.D.; Utterback, 
T.R.; Peterson, J.D.; Kelley, J.M. ; Cotton, M.D.; Weidman, J.M. ; Fujii, C; 
Bowman, C; Watthey, L. 
Nature 388, 539-547, 1997 

A;Authors: Wallin, E.; Hayes, W.S.; Borodovsky, M. ; Karpk, P.D.; Smith, H.O.; 
Eraser, CM.; Venter, J.C. 

A;Title: The complete genome sequence of the gastric pathogen Helicobacter 
pylori . 

A; Reference number: A64520; MUID : 97394467 ; PMID: 9252185 
A;Accession: F64665 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A; Residues: 1-545 <TOM> 

A;Cross-references: GB:AE000622; GB:AE000511; NID : g2314317 ; PIDN: AAD08211 . 1; 
PID:g2314323; TIGR:HP1166 

C; Super family : glucose-6-phosphate isomerase 
C;Keywords: intramolecular oxidoreductase; isomerase 

Query Match 10.7%; Score 75.5; DB 2; Length 545; 

Best Local Similarity 25.3%; Pred. No. 9.2; 

Matches 37; Conservative 20; Mismatches 44; Indels 45; Gaps 6; 
Qy 23 GMHNLL LLEGRSWQEMDGQ KKHWKDKWDLLYWRDIKKTGWFGASL 69 



Db 411 GHHEILFSNVLAQAQAFMKGKSYEEALGELLFKGLDKDEAKDLAHHR VFFGNRP 4 64 

Qy 70 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSD 112 

: I I I :: I : I I I : : 1 I I I I 

Db 4 65 SNILLLEKISPSNIGALVALYEHKVFV QGVIWDINSFDQWGVELGKELAVPILQE 519 

Qy 113 -EGHPFRAYLESEVAISEELVQKYSN 137 

III I I : I : : I : : I I 

Db 520 LEGHKSNAYFDSS TKHLIELYKN 542 



RESULT 15 
E71851 

glucose-6-phosphate isomerase - Helicobacter pylori (strain J99) 
C; Species: Helicobacter pylori 
A;Variety: strain J99 

C;Date: 12-Feb-1999 #sequence_revision 12-Feb-1999 #text_change 16-Jul-1999 
C; Access ion : E71851 

R;Alm, R.A. ; Ling, L.S.L.; Moir, D.T.; King, B.L.; Brown, E.D.; Doig, P.C.; 
Smith, D.R.; Noonan, B. ; Guild, B.C.; deJonge, B.L.; Carmel, G. ; Turnmino, P.J. 
Caruso, A.; Uria-Nickelsen, M. ; Mills, D.M.; Ives, C. ; Gibson, R. ; Merberg, D. 
Mills, S.D.; Jiang, Q.; Taylor, D.E.; Vovis, G.F.; Trust, T.J. 
Nature 397, 176-180, 1999 

A; Title: Genomic sequence comparison of two unrelated isolates of the human 
gastric pathogen Helicobacter pylori. 

A; Reference number: A71800; MUID : 9912 0557 ; PMID: 9923682 
A/Accession: E71851 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-545 <ARN> 

A;Cross-references: GB:AE001536; GB:AE001439; NID: g4 155675 ; PIDN :7UyD06664 . 1 ; 
PID:g4155679 

A; Experimental source: strain J99 
C; Genetics : 
A; Gene : pgi 

C; Superf amily : glucose-6-phosphate isomerase 

Query Match 10.6%; Score 74.5; DB 2; Length 545; 

Best Local Similarity 25.3%; Pred. No. 12; 

Matches 37; Conservative 19; Mismatches 45; Indels 45; Gaps 6 

Qy 23 GMHNLL LLEGRSWQEMDGQ KKHWKDKWDLLYWRDIKKTGWFGASL 69 

I I : I : : I : I : : I I : | I I : I I : I III 
Db 411 GHHEILFSNVLAQAQAFMKGKSYEEALGELLSKGLDKDEAKDLAHHR VFFGNRP 464 

Qy 70 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSD 112 

: I I I : : I : 1 I I : : I M I I 

Db 465 SNILLLEKISPSNIGALVALYEHKVFV QGVIWDINSFDQWGVELGKELAVPILQE 519 

Qy 113 -EGHPFRAYLESEVAISEELVQKYSN 137 

III I I : I : I : : I I 

Db 520 LEGHKSNAYFDSS TRHLIELYKN 542 



Search completed: September 29, 2004, 18:16:07 
Job time : 5.45105 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: September 29, 2004, 18:16:15 ; Search time 15.1353 Seconds 

(without alignments) 
2997.869 Million cell updates/se 



Title: US-09-8 30- 972-32 

Perfect score: 705 

Sequence: 1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1351062 seqs, 321799191 residues 

Total number of hits satisfying chosen parameters: 1351062 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB . pep : * 

2 : / cgn2_6/ptodata/ 2 /pubpaa/ PCT_NEW_PUB . pep : * 

3: /cgn2_6/ptodata/2/pubpaa/US0 6_NEW_PUB.pep:* 

4: /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep:* 

5: /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep:* 

6 : / cgn2_6/ptodata/2 /pubpaa/ PCTUS_PUBCOMB . pep : * 

7: /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep:* 

8 : / cgn2_6/ptodata/2 /pubpaa/US 0 8_PUBC0MB . pep : * 

9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 

10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep:* 

11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB . pep : * 

12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 

13: /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep:* 

14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep:-^ 

15: /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB,pep:* 

16: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 

17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 

18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



No. Score Match Length DB ID 



Description 



1 


510 


72. 


3 


199 


9 


US-09-893-348-21 


Sequence 21, Appl 


2 


503 


71. 


3 


118 


15 


US-10-264-237-1568 


Sequence 1568, Ap 


3 


503 


71. 


3 


199 


9 


US-09-893-348-25 


Sequence 25, Appl 


4 


503 


71. 


3 


199 


12 


US-10-660-946-1 


Sequence 1, Appli 


5 


503 


71. 


3 


199 


12 


US-09-97 8-3 60A-467 


Sequence 467, App 


6 


503 


71. 


3 


199 


16 


US-10-466-258-11 


Sequence 11, Appl 


7 


496 


70. 


4 


199 


12 


* US-10-408-967-9 


Sequence 9, Appli 


8 


447 


63. 


4 


1163 


9 


US-09-893-348-18 


Sequence 18, Appl 


9 


447 


63. 


4 


1192 


9 


US-09-789-386-2 


Sequence 2, Appli 


10 


447 


63. 


4 


1192 


9 


US-09-758-140-6 


Sequence 6, Appli 


11 


447 


63. 


4 


1192 


9 


US-09-893-348-23 


Sequence 23, Appl 


12 


447 


63, 


4 


1192 


9 


US-09-972-599A-6 


Sequence 6, Appli 


13 


447 


63. 


4 


1192 


12 


US-10-267-502-429 


Sequence 42 9, App 


14 


447 


63. 


4 


1192 


14 


US-10-060-036-71 


Sequence 71, Appl 


15 


447 


63. 


4 


1192 


16 


US-10-327-213-9 


Sequence 9, Appli 


16 


447 


63. 


4 


1192 


16 


US-10-466-258-9 


Sequence 9, Appli 


17 


443 


62 . 


8 


360 


9 


US-09-893-348-20 


Sequence 20, Appl 


18 


443 


62 . 


8 


373 


9 


US-09-789-386-6 


Sequence 6, Appli 


19 


443 


62. 


8 


373 


9 


US-09-765-205-6 


Sequence 6, Appli 


20 


443 


62. 


8 


373 


9 


US-09-893-348-24 


Sequence 24, Appl 


21 


443 


62 . 


8 


373 


12 


US-10-408-967-8 


Sequence 8, Appli 


22 


443 


62. 


8 


373 


14 


US-10-060-036-72 


Sequence 72, Appl 


23 


443 


62. 


8 


373 


16 


US-10-466-258-4 


Sequence 4, Appli 


24 


443 


62 . 


8 


379 


14 


US-10-205-194-164 


Sequence 164, App 


25 


440 


62 , 


,4 


1192 


12 


US-10-408-967-7 


Sequence 7, Appli 


26 


436.5 


61. 


, 9 


1163 


12 


US-10-267-502-431 


Sequence 431, App 


27 


349 


49. 


,5 


208 


12 


US-10-660-946-7 


Sequence 7, Appli 


28 


348 


49. 


,4 


2 67 


12 


US-10-660-946-8 


Sequence 8, Appli 


29 


348 


49. 


,4 


267 


14 


US-10-205-194-127 


Sequence 127, App 


30 


337 


47. 


.8 


356 


12 


US-10-660-946-6 


Sequence 6, Appli 


31 


337 


47. 


. 8 


776 


12 


US-10-660-946-5 


Sequence 5, Appli 


32 


337 


47. 


. 8 


776 


12 


US-10-267-502-430 


Sequence 430, App 


33 


337 


47. 


.8 


111 


14 


US-10-205-219-93 


Sequence 93, Appl 


34 


337 


47. 


.8 


780 


12 


US-10-267-502-432 


Sequence 432, App 


35 


309 


43, 


.8 


236 


9 


US-09-729-674-20 


Sequence 20, Appl 


36 


309 


43, 


. 8 


236 


9 


US-09-765-205-26 


Sequence 26, Appl 


37 


309 


43, 


. 8 


236 


12 


US-10-408-967-2 


Sequence 2, Appli 


38 


309 


43, 


.8 


266 


12 


US-10-276-774-2330 


Sequence 2330, Ap 


39 


309 


43, 


.8 


269 


14 


US-10-106-698-6222 


Sequence 6222, Ap 


40 


305 


43, 


.3 


241 


12 


US-10-660-946-3 


Sequence 3, Appli 


41 


304 


43 


.1 


593 


15 


US-10-108-260A-2892 


Sequence 2892, Ap 


42 


269 


38 


.2 


161 


9 


US-09-925-302-808 


Sequence 808, App 


43 


269 


38 


.2 


161 


12 


US-09-925-302-808 


Sequence 808, App 


44 


259.5 


36 


. 8 


204 


12 


US-10-424-599-194160 


Sequence 194160, 


45 


256 


36 


.3 


234 


12 


US- 10-42 4-5 99-2008 40 


Sequence 200840, 



ALIGNMENTS 



RESULT 1 

US-09-893-348-21 

; Sequence 21, Application US/09893348 
; Patent No. US20020072493A1 
; GENERAL INFORMATION: 



APPLICANT: EISENBACH-SCHWARTZ , Michal 
APPLICANT: COHEN, Iran R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/ 09/ 8 93 , 34 8 
CURRENT FILING DATE: 2001-06-2 8 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/ 14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentin version 3.1 
SEQ ID NO 21 
LENGTH: 199 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-21 

Query Match 72.3%; Score 510; DB 9; Length 199; 

Best Local Similarity 99,0%; Pred. No. 5.9e-51; 

Matches 103; Conservative 0; Mismatches 1; Indels 0; Gaps 0 

Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I 
Db 1 MDGQKKHWKDKVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNS7VLG 141 

I I I I I I M I I I I I I I I M I I I I I I M I I I I I I I I I I I I M I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 2 

US-10-264-237-1568 

; Sequence 1568, Application US/10264237 

; Publication No. US20040009491A1 

; GENERAL INFORMATION: 

; APPLICANT: Birse et al . 

; TITLE OF INVENTION: Nucleic Acids, Proteins, and Antibodies 
; FILE REFERENCE: PA131P1 

; CURRENT APPLICATION NUMBER: US/10/264 , 2 37 

CURRENT FILING DATE: 2002-10-04 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 1 6450 
; PRIOR FILING DATE: 2001-05-18 
; PRIOR APPLICATION NUMBER: US 60/205,515 
; PRIOR FILING DATE: 2000-05-19 
; NUMBER OF SEQ ID NOS: 2876 
; SOFTWARE: Patentin Ver. 3.1 
; SEQ ID NO 1568 
LENGTH: 118 



TYPE: PRT 
; ORGANISM: Homo sapiens 
; FEATURE: 

; NAME/KEY: MISC_FEATURE 
LOCATION: (118) 

; OTHER INFORMATION: Xaa equals any of the twenty naturally occurring L-amino 
acids 

US-10-264-237-1568 

Query Match 71.3%; Score 503; DB 15; Length 118; 

Best Local Similarity 98.1%; Pred. No. 1.9e-50; 

102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I I I : I I I I M I I I I I I I I I I I I I I I I I I I I M I I I M M I I I I I I I I M I I I I I I I I 
1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



Matches 

Qy 

Db 

Qy 

Db 



RESULT 3 

US-09-893-348-25 

Sequence 25, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EISENBACH-SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT : MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/09/893, 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS: 29 
SOFTWARE: Patentin version 3.1 
SEQ ID NO 25 
LENGTH: 199 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-893-348-25 



Query Match 71.3%; Score 503; DB 9; Length 199; 

Best Local Similarity 98.1%; Pred. No. 3.9e-50; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 



0; 



Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAIALLSVTIS 97 

I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I 
Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 4 
US-10-660-946-1 

; Sequence 1, Application US/10660946 
; Publication No. US20040063131A1 
; GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
; Au- Young, Janice 

; Goli, Surya K. 

; Hillman, Jennifer L . 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
; NUMBER OF SEQUENCES: 9 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 
; CITY: Palo Alto 

STATE: CA 

COUNTRY: U.S. 

ZIP: 94304 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

; COMPUTER: IBM Compatible 

; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 10/ 660 , 94 6 

FILING DATE: 12-Sep-2003 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US/ 09/22 8 , 2 13A 

FILING DATE: <Unknown> 
; APPLICATION NUMBER: 08/700,607 

; FILING DATE: <Unknown> 

ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

; REGISTRATION NUMBER: 36,749 

; REFERENCE/DOCKET NUMBER: PF-0114 US 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 1: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 199 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

; LIBRARY: <Unknown> 

; CLONE: Consensus 



SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-10-660-946-1 



Query Match 71.3%; Score 503; DB 12; Length 199; 

Best Local Similarity 98.1%; Pred. No. 3.9e-50; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I I I : I I I I I I I I I I I I I I M M I I I I I I I I I I M I I I I I I M I M I I I I I I I M I I I 
Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 5 

US-0 9-97 8-360A-467 

Sequence 467, Application US/09978360A 
Publication No. US20040110939A1 
GENERAL INFORMATION: 
APPLICANT: Edwards, Jean-Baptiste Dumas Milne 
APPLICANT: Duclert, Aymeric 
APPLICANT: Bougueleret, Lydie 
APPLICANT: Jobert, Severin 
APPLICANT: Clusel, Catherine 

TITLE OF INVENTION: Complementary DNA's Encoding Proteins with Signal 
Peptides 

FILE REFERENCE: 56.US4.CIP 

CURRENT APPLICATION NUMBER: US/ 0 9/ 97 8 , 360A 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: US 60/066,677 
PRIOR FILING DATE: 1997-11-13 
PRIOR APPLICATION NUMBER: US 60/069,957 
PRIOR FILING DATE: 1997-12-17 
PRIOR APPLICATION NUMBER: US 60/074,121 
PRIOR FILING DATE: 1998-02-09 
PRIOR APPLICATION NUMBER: US 60/081,563 
PRIOR FILING DATE: 1998-04-13 
PRIOR APPLICATION NUMBER: US 60/096,116 
PRIOR FILING DATE: 1998-08-10 
PRIOR APPLICATION NUMBER: US 60/099,273 
PRIOR FILING DATE: -09-04 
PRIOR APPLICATION NUMBER: US 09/191,997 
PRIOR FILING DATE: 1998-11-13 
PRIOR APPLICATION NUMBER: US 09/215,435 
PRIOR FILING DATE: 1998-12-17 
PRIOR APPLICATION NUMBER: PCT/IB98/02122 
PRIOR FILING DATE: 1998-12-17 
PRIOR APPLICATION NUMBER: US 09/247,155 
PRIOR FILING DATE: 1999-02-09 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 810 
SOFTWARE: Patent. pm 
SEQ ID NO 467 
LENGTH: 199 
TYPE: PRT 



ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY: SIGNAL 
LOCATION: -42.. -1 
US-09-978-360A-467 



Query Match 71.3%; 
Best Local Similarity 98.1%; 
Matches 102; Conservative 



Score 503; DB 12; 
Pred. No. 3.9e-50; 
1; Mismatches 1; 



Length 199; 



Indels 



0; Gaps 



0; 



Qy 

Db 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I 
1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 



Qy 

Db 



98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M M I I I I I I I I I 
61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 6 

US-10-466-258-11 

Sequence 11, Application US/10466258 
Publication No. US20040132096A1 
GENERAL INFORMATION: 
APPLICANT: GLAXO GROUP LIMITED 
TITLE OF INVENTION: ASSAY 
FILE REFERENCE: P80966 GCW 

CURRENT APPLICATION NUMBER: US/ 10/4 66, 258 
CURRENT FILING DATE: 2003-07-15 
NUMBER OF SEQ ID NOS: 13 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 11 
LENGTH: 199 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-466-258-11 

Query Match 71.3%; Score 503; DB 16; Length 199; 

Best Local Similarity 98.1%; Pred. No. 3.9e-50; 

Matches 102; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 38 MDGQKKHWKDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I I I : I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 7 
US-10-408-967-9 

; Sequence 9, Application US/10408967 
; Publication No. US20040063161A1 
; GENERAL INFORMATION: 

; APPLICANT: Pharmacia & Upjohn Company 
; APPLICANT: Yan, Riqiang 



APPLICANT: Lu, Yifeng 

TITLE OF INVENTION: Compositions and Methods of Treating Alzheimer *s Diseas 
FILE REFERENCE: 00925 

CURRENT APPLICATION NUMBER: US/10/408, 967 
CURRENT FILING DATE: 2003-04-08 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentin version 3.1 
SEQ ID NO 9 
LENGTH: 199 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-408-967-9 



Query Match 70.4%; Score 496; DB 12; Length 199; 

Best Local Similarity 97.1%; Pred. No. 2.5e-49; 

Matches 101; Conservative 1; Mismatches 2; Indels 0; 



Gaps 



Qy 

Db 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

I I I I I I : I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I 
1 MDGQKKNWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 



Qy 

Db 



98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 FRIYKGVIQAIQKSDEGHPFPAYLESEVAISEELVQKYSNSALG 104 



RESULT 8 

US-09-893-348-18 

Sequence 18, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EISENBACH-SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAt^, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 
CURRENT APPLICATION NUMBER: US/09/893, 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS: 29 
SOFTWARE: Patentin version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 



Query Match 63.4%; Score 447; DB 9; Length 1163; 

Best Local Similarity 96.9%; Pred. No. 1.3e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M 
Db 973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I II I II II I I I I I I I I I I I I I I I II I I I 
Db 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1068 



RESULT 9 
US-09-789-386-2 

Sequence 2, Application US/09789386 
Patent No. US2 0020010324A1 
GENERAL INFORMATION: 
APPLICANT: MICHALOVICH, DAVID 
APPLICANT: PRINJHA, RABINDER KUMAR 
TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-Cl 

CURRENT APPLICATION NUMBER: US/09/789, 386 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS: 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 2 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-2 

Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 1.4e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I M I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I II I I I I I I I I I I I II II II 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSTUjG 141 

III I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I 
Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 10 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US20020012965A1 
; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 



; TITLE OF INVENTION: No. US20020012965Alo Receptor-Mediated Blockade of 
Axonal Growth 

; FILE REFERENCE: 44574-5073-US 

; CURRENT APPLICATION NUMBER: US/09/758 , 140 

; CURRENT FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: US 60/175,707 

; PRIOR FILING DATE: 2000-01-12 

; PRIOR APPLICATION NUMBER: US 60/207,366 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: US 60/236,378 

; PRIOR FILING DATE: 2000-09-29 

; NUMBER OF SEQ ID NOS : 20 

; SOFTWARE: PatentlnVer. 2.1 

; SEQ ID NO 6 

; LENGTH: 1192 

; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-758-140-6 

Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 1.4e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAIALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I 
Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 11 
US-09-893-348-23 

Sequence 23, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EISENBACH-SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT : BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC 7\NTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/ 09/893 , 348 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/14715 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS: 29 
SOFTWARE: Patent In version 3.1 



; SEQ ID NO 23 
; LENGTH: 1192 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-893-348-23 

Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Fred. No. 1.4e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III II I I I I I I I I I I I I II I I I M I I I I II I I I I I 
Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 12 
US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US2 0020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER, STEPHEN M. 

; TITLE OF INVENTION: NOGO-RECEPTOR-MEDIATED BLOCKME OF AXONAL GROWTH 
FILE REFERENCE: C077 CIP US 

CURRENT APPLICATION NUMBER: US/ 09/ 972 , 599A 

CURRENT FILING DATE: 2001-10-06 
; PRIOR APPLICATION NUMBER: PCT/USOl/01041 
; PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 09/758,140 

PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 60/236,378 
; PRIOR FILING DATE: 2000-09-29 
; PRIOR APPLICATION NUMBER: 60/207,366 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 
; SOFTWARE: Patentin Ver . 2.1 
; SEQ ID NO 6 

LENGTH: 1192 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-972-599A-6 

Query Match 63.4%; Score 447; DB 9; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 1.4e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I II I II I II I I I I I I I I I I I I I I I I I I 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I I M I I I I I I I I II I I I I I I II I M I I I 



Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 13 
US-10-267-502-429 

; Sequence 429, Application US/10267502 

; Publication No. US20040071700A1 

; GENERAL INFORMATION: 

; APPLICANT: Kim, Jaeseob 

; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 

FILE REFERENCE: LSD-07416 
; CURRENT APPLICATION NUMBER: US/ 10/2 67 , 5 02 
; CURRENT FILING DATE: 2003-01-27 
; NUMBER OF SEQ ID NOS : 439 
; SOFTWARE: Patentin version 3.2 
; SEQ ID NO 429 
; LENGTH: 1192 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-267-502-429 

Query Match 63,4%; Score 447; DB 12; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 1.4e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAIALLSVTISFRIYKGVI 105 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I II I I I I I I I II I I I I I I I I I II I M I I I I 
Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 14 
US-10-060-036-71 

Sequence 71, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 
APPLICANT: Benson, Darin R, 
APPLICANT: Kalos, Michael D. 
APPLICANT: Lodes, Michael J, 
APPLICANT: Persing, David H. 
APPLICANT: Hepler, William T. 
APPLICANT: Jiang, Yuqiu 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
FILE REFERENCE: 210121.566 

CURRENT APPLICATION NUMBER: US/ 10/ 060 , 036 
CURRENT FILING DATE: 2002-01-30 
NUMBER OF SEQ ID NOS: 4560 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 71 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 



US-10-060-036-71 

Query Match 63.4%; Score 447; DB 14; Length 1192; 

Best Local Similarity 96.9%; Pred. No. l,4e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I II I I I I I I I I I M I I I I II I I I I I I I I I I 
Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 15 
US-10-327-213-9 

; Sequence 9, Application US/10327213 

; Publication No, US20040121341A1 

; GENERAL INFORMATION: 

; APPLICANT: FILBIN, MARIE T, 

; APPLICANT: DOMENICONI, MARCO 

; APPLICANT: CAO, ZIXUAN 

; TITLE OF INVENTION: INHIBITORS OF MYELIN-ASSOCIATED GLYCOPROTEIN (MAG) 

; TITLE OF INVENTION: ACTIVITY FOR REGULATING NEURAL GROWTH AND REGENERATION 

; FILE REFERENCE: CUNY/003 

; CURRENT APPLICATION NUMBER: US/ 10/327 , 2 13 

; CURRENT FILING DATE: 2002-12-20 

; NUMBER OF SEQ ID NOS : 43 

; SOFTWARE: PatentlnVer. 2.1 

; SEQ ID NO 9 

LENGTH: 1192 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-327-213-9 

Query Match 63.4%; Score 447; DB 16; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 1.4e-42; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I II I I II I I I I I I M I I I II I I I I I I I I I II I I I I I II I I I I I II I I I I I I I I I I I I I 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I 
Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



Search completed: September 29, 2004, 18:48:20 
Job time : 16.1353 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



Septeinber 29, 2004, 18:06:43 ; Search time 11.0927 Seconds 

(without alignments) 
4010.587 Million cell updates/sec 

US-09-830-972-32 
705 

1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1017041 



Database : SPTREMBL_25 : * 

sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate: * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
9: sp_phage:* 
10: sp_plant:* 
11: sp_rodent:* 
12: sp_virus:* 
13: sp vertebrate:* 
14: sp_unclassif led: * 
15: sp_rvirus : * 
16: sp_bacteriap: * 
17: sp_archeap:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution, 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


460 


65. 


.2 


199 


13 


Q7T224 


Q7t224 gallus gall 


2 


447 


63. 


. 4 


578 


11 


Q80W95 


Q80w95 mus musculu 


3 


447 


63. 


.4 


639 


11 


Q8K290 


Q8k290 mus musculu 


4 


447 


63. 


.4 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


5 


447 


63. 


.4 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


6 


447 


63. 


.4 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


7 


443 


62. 


.8 


356 


11 


Q8BH78 


Q8bh7 8 mus musculu 


8 


443 


62, 


.8 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


9 


443 


62. 


, 8 


392 


4 


Q96B16 


Q96bl6 homo sapien 


10 


436.5 


61. 


.9 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


11 


432.5 


61, 


.3 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


12 


417 


59. 


, 1 


184 


6 


Q7YRW9 


Q7yrw9 bos taurus 


13 


386 


54 . 


, 8 


179 


6 


Q9GM33 


Q9gm33 macaca fasc 


14 


384 


54 . 


, 5 


214 


13 


Q7T222 


Q7t222 carassius a 


15 


348 


49, 


.4 


199 


4 


Q9BQ59 


Q9bq5 9 homo sapien 


16 


348 


49. 


.4 


267 


11 


Q63765 


Q63765 rattus sp. 


17 


337 


47 , 


. 8 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


18 


337 


47 . 


. 8 


780 


11 


Q8K0T0 


Q8k0t0 mus musculu 


19 


325 


46. 


, 1 


208 


13 


Q90637 


Q90637 gallus gall 


20 


320 


45. 


, 4 


760 


13 


Q90638 


Q90638 gallus gall 


21 


308 


43, 


.7 


236 


11 


Q8VBU0 


QSvbuO rattus norv 


22 


308 


43, 


, 7 


237 


11 


Q8C6D5 


Q8c6d5 mus musculu 


23 


308 


43. 


.7 


643 


11 


Q8CCU2 


Q8ccu2 mus musculu 


24 


283.5 


40. 


.2 


221 


13 


Q7ZUD6 


Q7zud6 brachydanio 


25 


255 


36. 


,2 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


26 


253 


35. 


. 9 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


27 


252 


35. 


. 7 


222 


5 


Q9VMW4 


Q9vmw4 drosophila 


28 


252 


35. 


, 7 


595 


5 


Q9VMV9 


Q9vmv9 drosophila 


29 


250 


35, 


. 5 


202 


5 


Q9VMW2 


Q9vmw2 drosophila 


30 


162 


23. 


. 0 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


31 


159 


22 , 


, 6 


2607 


5 


Q23187 


Q2 3187 caenorhabdi 


32 


158 


22 , 


. 4 


222 


5 


Q23188 


Q2318 8 caenorhabdi 


33 


104 


14, 


, 8 


107 


13 


Q7T223 


Q7t223 carassius a 


34 


102 . 5 


14 , 


. 5 


154 


5 


Q9VIB7 


Q9vib7 drosophila 


35 


102 . 5 


14 . 


. 5 


158 


5 


Q24199 


Q24199 drosophila 


36 


85 


12 . 


, 1 


457 


10 


Q8LDS3 


Q81ds3 arabidopsis 


37 


83 . 5 


11 , 


. 8 


295 


3 


Q04947 


Q04947 saccharomyc 


38 


82 . 5 


11 . 


. 7 


1247 


16 


Q88QU8 


Q88qu8 pseudomonas 


39 


81.5 


11. 


. 6 


464 


5 


Q09484 


Q09484 caenorhabdi 


40 


80 


11. 


.3 


564 


10 


Q8L7Z9 


Q817z9 spinacia ol 


41 


79 


11. 


.2 


568 


16 


Q9CKM1 


Q9ckml pasteurella 


42 


79 


11. 


.2 


618 


5 


Q22003 


Q22003 caenorhabdi 


43 


79 


11, 


.2 


638 


16 


Q83F64 


Q83f64 coxiella bu 


44 


78.5 


11, 


. 1 


261 


16 


Q83KX1 


Q83kxl shigella fl 


45 


78.5 


11, 


. 1 


458 


16 


Q9X1C8 


Q9xlc8 thermotoga 



7VLIGNMENTS 



RESULT 1 
Q7T224 

ID Q7T224 PRELIMINARY 
AC Q7T224; 

DT Ol-OCT-2003 (TrEMBLrel. 
DT Ol-OCT-2003 (TrEMBLrel. 
DT Ol-OCT-2003 (TrEMBLrel, 



PRT; 199 AA. 
25, Created) 

25, Last sequence update) 
25, Last annotation update) 



DE RTN4-C. 

GN RTN4 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=227158 87; PubMed=12 8322 8 8 ; 

RA Oertle T., Klinger M. , Stuermer C.A. , Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164737; AAP47312.1; -. 

SQ SEQUENCE 199 AA; 22293 MW; 07CF4E4EF272 3251 CRC64; 

Query Match 65.2%; Score 460; DB 13; Length 199; 

Best Local Similarity 88.5%; Pred. No. 6.3e-39; 

Matches 92; Conservative 5; Mismatches 7; Indels 0; Gaps 0; 

Qy 38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI7VLALLSVTIS 97 

II I M I II II II II II I II II I I II II I II II II II I II II II II I I II II II II I 
Db 1 MDSQPSGWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I II I I I II I I I I I I I I I I I I I M I : I I : I I : I : I I II : II 
Db 61 FRIYKGVIQAIQKSDEGHPFRAYLESDVAVSEDLIQKYSSWLG 104 



RESULT 2 
Q80W95 

ID Q8 0W95 PRELIMINARY; PRT; 578 AA. 

AC Q8 0W95; 

DT Ol-JUN-2003 (TrEMBLrel. 24, Created) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-A (Fragment) . 

GN NOGO-A. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tozaki H., Hirata T.; 

RT "The partial sequence of mouse nogo-A cDNA clone#4 109 . " ; 

RL Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AB073672; BAC75974,1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; lEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1, 

DR PROSITE; PS50845; RETICULON; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 578 AA; 63696 MW; 832670C171E4AC61 CRC64 ; 



Query Match 



63.4%; Score 447; DB 11; Length 578; 



Best Local Similarity 96.9%; Pred. No. 4.4e-37; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 



Qy 4 6 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I M I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 388 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 447 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I I I I I I I I I I II II I I I I I I I I I II M I 
Db 448 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 483 



RESULT 3 
Q8K290 

ID Q8K290 PRELIMINARY; PRT; 639 AA. 

AC Q8K290; 

DT Ol-OCT-2002 (TrEMBLrel. 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel, 22, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC032192; 7\AH32192.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO:0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW; 30 9A19DA37 603F11 CRC64; 

Query Match 63.4%; Score 447; DB 11; Length 639; 

Best Local Similarity 96.9%; Pred. No. 4.9e-37; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYI7VLALLSVTISFRIYKGVI 105 

I I I I I I I I I I I I I II I II I I I I I I I I I I I I II M I I I I I I I I I I II I I I M I I I I I I I 
Db 449 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 508 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I M I I I I I I I I I I I M I I I M I I I I I II 
Db 509 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 544 



RESULT 4 
Q8IUA4 

ID Q8IUA4 PRELIMINARY; PRT; 986 AA. 

AC Q8IUA4; 

DT Ol-MAR-2003 (TrEMBLrel. 23, Created) 



DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RT 
RT 
RL 
RN 
RP 
RA 
RL 
RN 
RP 
RA 
RL 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
SQ 



(TrEMBLrel. 
(TrEMBLrel. 



23, Last sequence update) 

24, Last annotation update) 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Ol-MAR-2003 
Ol-JUN-2003 
RNT4 (RTN4) . 
RTN4 . 

Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4 . " ; 

Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

[2] 

SEQUENCE FROM N.A. 
Oertle T., Schwab M.E,; 

Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
Van der Putten H. ; 

Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A, 
TISSUE=Testis; 

MEDLINE=22376540; PubMed=124 88097 ; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4."; 
J. Mol. Biol. 325:299-323(2003). 
EMBL; AY102285; AAM64244.1; 
EMBL; AY123245; 7\AM64249.1; 
EMBL; AY123246; AAM64250.1; 
EMBL; AY123247; AAM64251.1; 
EMBL; AY123248; AAM64252.1; 
EMBL; AY123249; AAM64253.1; 
EMBL; AY123250; AAM64254.1; 

GO; GO: 0005783; C : endoplasmic reticulum; TEA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

SEQUENCE 986 AA; 108449 MW; 0CDE8F647036415A CRC64; 



Query Match 63.4%; 
Best Local Similarity 96.9%; 
Matches 93; Conservative 



Score 447; DB 4; Length 986; 
Pred. No. 7.9e-37; 
0; Mismatches 3; Indels 



0; Gaps 



0 



Qy 



Db 



46 KDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I I I M I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
796 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 855 



Qy 

Db 



106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I I I I I I I M I I II I M I I M II I I I I II 
856 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 8 91 



RESULT 5 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
SQ 



PRELIMINARY; 

(TrEMBLrel. 
(TrEMBLrel. 
(TrEMBLrel. 



PRT; 1046 AA. 
23, Created) 

23, Last sequence update) 
25, Last annotation update) 



Q8BGK7 
ID Q8BGK7 
Q8BGK7; 
Ol-MAR-2003 
Ol-MAR-2003 
Ol-OCT-2003 
RTN4 , 
RTN4. 

Mus musculus (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus , 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., van der Putten H., Schwab M,E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4 . " ; 
Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 
[2] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 



[3] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7; 
Van der Putten H. ; 

Submitted (MAY-2 002) to the EMBL/GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A. 

STRAIN=129SvcJ7; 

Van der Putten H., Mir A.; 

Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
EMBL; AY102280; AAM73502.1; -. 
EMBL; AY102286; AAM73507.1; 
MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
GO; GO: 0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

SEQUENCE 1046 AA; 114221 MW; 8CE2E223 8ED51222 CRC64; 



Query Match 63.4%; 
Best Local Similarity 96.9%; 
Matches 93; Conservative 



Score 447; DB 11; 
Pred. No. 8.5e-37; 
0; Mismatches 3; 



Length 1046; 
Indels 0; Gaps 



0; 



Qy 

Db 



46 KDKVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 
856 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAIALLSVTISFRIYKGVI 915 



Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNST^G 141 

III I II I I I I I II I II I I I I I II I II I I I I I II I I 
Db 916 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 951 



RESULT 6 
Q8BGiyi9 

ID Q8BGM9 PRELIMINARY; PRT; 1162 AA. 

AC Q8BGM9; 

DT Ol-MAR-2003 (TrEMBLrel. 23, Created) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-129/SVCJ7, and 129SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 , " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-129/SVCJ7, and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H, ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7 ; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY102284; AAM73506.1; -. 

DR EMBL; AY1022B6; AAM73511.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1162 AA; 126613 MW; 855697FBEE11781F CRC64; 

Query Match 63.4%; Score 447; DB 11; Length 1162; 
Best Local Similarity 96.9%; Pred. No. 9.5e-37; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 6 KDKVVDLLYWRDIKKTGVVFGASLFLLLSLTVFSIVSVTAYIALALLSAn'ISFRIYKGVI 105 

I I I I I { I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I 

Db 972 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1031 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

III I I I I I I I I I I I M I II I II I I I II I I I I I I II 

Db 1032 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1067 



RESULT 7 
Q8BH78 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 



PRELIMINARY; 



PRT; 



356 AA. 



(TrEMBLrel, 
(TrEMBLrel, 
(TrEMBLrel. 



23, 
23, 
25, 



Created) 

Last sequence update) 
Last annotation update) 



Q8BH78 
Q8BH78; 
Ol-MAR-2003 
Ol-MAR-2003 
Ol-OCT-2003 
RTN4. 
RTN4. 

Mus mus cuius (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteieostomi ; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBI_TaxID=100 90; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and I29SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted {OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N.A. 
RC STRAIN=129/SvcJ7; 
RA Van der Putten H,; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
DR EMBL; AY102281; AAM73503.1; -. 
DR EMBL; AY102286; AAM73508.1; -. 
DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
DR GO; GO: 0007399; P : neurogenesis ; IDA. 
DR InterPro; IPR003388; Reticulon. 
DR Pfam; PF02453; Reticulon; 1. 
DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 356 AA; 38403 MW; 4366C03BA9630B56 CRC64; 



Query Match 62.8%; Score 443; DB 11; Length 356; 

Best Local Similarity 98.9%; Pred. No. 6.5e-37; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; 



Gaps 



0; 



Qy 

Db 



49 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 108 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
169 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 228 



Qy 



Db 



109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I 
229 QKSDEGHPFRAYLESEVAISEELVQKYSNSAiG 261 



RESULT 
Q8BHF5 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus . 



Q8BHF5 PRELIMINARY; PRT; 375 AA. 

Q8BHF5; 

Ol-MAR-2003 (TrEMBLrel, 23, 
Ol-MAR-2003 (TrEMBLrel. 23, 
Ol-OCT-2003 (TrEMBLrel. 25, 
RTN4. 
RTN4 . 

Mus mus cuius (Mouse) . 
OC Eukaryota; Metazoa; Chordata; 
OC Maininalia; Eutheria; Rodent ia; 
OX NCB I_TaxI D= 10090; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-129/SVCJ7, and 129SvcJ7; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N.A, 
RC STRAIN=12 9/SvcJ7; 
RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H, , Mir A,; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 
DR EMBL; AY102282; AAM73504.1; 
DR EMBL; AY102286; AAM73509.1; -. 
DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
DR GO; GO: 0007399; P : neurogenesis ; IDA. 
DR InterPro; IPR003388; Reticulon. 
DR Pfam; PF02453; Reticulon; 1. 
DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 375 AA; 40300 MW; 23D9EB19BE671AE6 CRC64; 



Query Match 62.8%; Score 443; DB 11; Length 375; 

Best Local Similarity 98.9%; Pred. No. 6.9e-37; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; 



Gaps 



0; 



Qy 

Db 



49 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 108 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I 
18 8 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 247 



Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I 



Db 248 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 280 



RESULT 9 
Q96B16 

ID Q96B16 PRELIMINARY; PRT; 392 AA. 

AC Q96B16; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-JUN-2003 (TrEMBLrel, 24, Last annotation update) 

DE Hypothetical protein (RTN4) . 

GN RTN4 . 

OS Homo sapiens (Human) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9 606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Kidney; 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Van der Putten H. ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22376540; PubMed=124 8 8 097 ; 

RA Oertle T., van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4."; 

RL J. Mol. Biol. 325:299-323(2003). 

DR EMBL; BC016165; AAH16165.1; -. 

DR EMBL; AY102285; AAM64242.1; -. 

DR EMBL; AY102278; AAM64247.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; lEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 392 AA; 42274 MW; D7B2AA5E839E58AD CRC64; 

Query Match 62.8%; Score 443; DB 4; Length 392; 

Best Local Similarity 98.9%; Pred. No. 7.3e-37; 

Matches 92; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



4 9 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 108 



Db 205 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 264 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I i I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 265 QKSDEGHPFRAYLESEVAISEELVQKYSNSALG 297 



RESULT 10 
Q8K3G8 

ID Q8K3G8 PRELIMINARY; PRT; 1163 AA. 

AC Q8K3G8; 

DT 01~OCT-2002 (TrEMBLrel. 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-A. 

GN RTN4 , 

OS Mus musculus (Mouse) . 

00 Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Long M. , Li R. , Ju G. ; 

RT "Cloning and expression of the mouse Nogo-A protein."; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY114152; AAM77068.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1, 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F3627 99417EA4 CRC64; 

Query Match 61.9%; Score 436.5; DB 11; Length 1163; 

Best Local Similarity 95.9%; Pred. No. l.le-35; 

Matches 93; Conservative 0; Mismatches 3; Indels 1; Gaps 1; 

Qy 46 KDKWDLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 104 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 972 KTSWDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGV 1031 

Qy 105 IQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I I I I I I I I I I I I M I I I I M M I I I I I I I 
Db 1032 IQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS7VLG 1068 



RESULT 11 
Q8K3G7 

ID Q8K3G7 PRELIMINARY; PRT; 357 AA. 

AC Q8K3G7; 

DT Ol-OCT-2002 (TrEMBLrel. 22, Created) 
DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 
DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE Nogo-B. 



GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Li R. , Long M. , Shen J., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY114153; AAM77069.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64; 

Query Match 61.3%; Score 432.5; DB 11; Length 357; 

Best Local Similarity 91,9%; Pred, No. 7.7e-36; 

Matches 92; Conservative 0; Mismatches 1; Indels 1; Gaps 1 

Qy 49 WDLLYWRDIKKTGW-FGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 107 

I I I I I I I M M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 169 WDLLYWRDIKKTGWYFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 228 

Qy 108 lAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 229 IQKSDEGHPFRAYLESEVAISEELVQKYSNSTMiG 262 



RESULT 12 
Q7YRW9 

ID Q7YRW9 PRELIMINARY; PRT; 184 AA. 

AC Q7YRW9; 

DT Ol-OCT-2003 (TrEMBLrel. 25, Created) 

DT Ol-OCT-2003 (TrEMBLrel, 25, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4w (Fragment) . 

GN RTN4 . 

OS Bos taurus (Bovine) , 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12832288; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164744; AAP47319.1; -. 

FT NON_TER 1 1 

SQ SEQUENCE 184 AA; 20671 MW; DE990E03BBAF8 4A1 CRC64; 



Query Match 59.1%; Score 417; DB 6; Length 184; 

Best Local Similarity 96.6%; Pred. No. 1.4e-34; 

Matches 86; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 53 LYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSD 112 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I III 
Db 1 LYWRDIKKTGWFGXXLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSD 60 

Qy 113 EGHPFRAYLESEVAISEELVQKYSNSALG 141 

I II I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 61 EGHPFRAYLESEVAISEELVQKYSNSALG 8 9 



RESULT 13 
Q9GM33 



ID 
AC 
DT 
DT 
DT 
DE 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



Q9GM33 
Q9GM33; 
Ol-MAR-2001 
Ol-MAR-2001 
Ol-OCT-2003 



PRELIMINARY; 



PRT; 



179 AA. 



(TrEMBLrel. 16, 
(TrEMBLrel. 16, 
(TrEMBLrel. 25, 
Hypothetical protein. 

Macaca fascicularis (Crab eating macaque) 



Created) 

Last sequence update) 
Last annotation update) 



(Cynomolgus monkey) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Cercopithecidae; 



Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
Cercopithecinae; Macaca. 
NCBI_TaxID=9541; 
[1] 

SEQUENCE FROM N.A. 
TISSUE=Brain; 

Osada N., Hida M. , Kusuda J., Tanuma R., Iseki K., Hirai M., Terao K. 
Suzuki Y. , Sugano S., Hashimoto K. ; 

"Isolation of full-length cDNA clones from macaque brain cDNA 
libraries . " ; 

Submitted (OCT-2000) to the EMBL/GenBank/DDBJ databases. 
EMBL; A£049853; BAB16739.1; -. 

GO; GO: 0005783; C : endoplasmic reticulum; lEA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 
Hypothetical protein. 

SEQUENCE 179 AA; 19949 MW; 5F8CD4383FEE9E02 CRC64; 



Query Match 54 . 8%; 

Best Local Similarity 96.4%; 
Matches 81; Conservative 



Score 386; DB 6; 
Pred. No. 2e-31; 
2; Mismatches 1; 



Length 179; 
Indels 0; 



Gaps 



0; 



Qy 

Db 

Qy 

Db 



58 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSDEGHPF 117 

: I M II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I : I I I I I I II I 11 I I I 
1 MKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYRGVIQAIQKSDEGHPF 60 

118 RAYLESEVAISEELVQKYSNSALG 141 
I M I I I II I I I I I I I I I I I I I II I 
61 RAYLESEVAISEELVQKYSNS7VLG 84 



RESULT 14 



Q7T222 

ID Q7T222 PRELIMINARY; PRT; 214 AA. 

AC Q7T222; 

DT Ol-OCT-2003 (TrEMBLrel. 25, Created) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4-M. 

GN RTN4 . 

OS Carassius auratus (Goldfish) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyp rini formes ; 

OC Cyprinidae; Carassius, 

OX NCBI_TaxID^7957; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed-12 8322 8 8 ; 

RA Oertle T., Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164754; AAP47329.1; 

SQ SEQUENCE 214 AA; 24063 MW; E9B603B043159DD9 CRC64; 

Query Match 54.5%; Score 384; DB 13; Length 214; 

Best Local Similarity 71.4%; Pred. No. 3.9e-31; 

Matches 70; Conservative 20; Mismatches 8; Indels 0; Gaps 0 

Qy 43 KHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYK 102 

I I I I I I I I I I I I I ::: I I I I I M I I I I M I : I I I : I I : I : I I I I I I I I I I I I I M 
Db 21 KHWKEQWDLLYWRDLQRTGWFGASLLLLLSLSVCSIISVISYVALALLSVTISFRIYK 80 

Qy 103 GVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSAL 140 

I : : I I : I I : : I M I : I I : : : II I I I I I I I :: II 
Db 81 GILQAVQKSEDGHPFKMYLDKDIGISSELVQKYSDTAL 118 



RESULT 15 
Q9BQ59 

ID Q9BQ59 PRELIMINARY; PRT; 199 7\A. 

AC Q9BQ59; 

DT Ol-JUN-2001 (TrEMBLrel. 17, Created) 

DT Ol-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to reticulon 1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9 606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lung; 

RA Strausberg R. ; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; BC003003; AAH03003.1; -. 

DR EMBL; BC000314; AAH00314.1; -. 

DR GO; GO: 0005783; C : endoplasmic reticulum; lEA. 

DR GO; GO: 0000786; C : nucleosome; lEA. 



DR GO; GO: 0005634; C:nucleus; lEA. 

DR GO; GO: 0003677; FiDNA binding; lEA, 

DR GO; GO: 0007001; P: chromosome organization and biogenesis (sen. . .; lEA. 

DR GO; GO: 0006334; P:nucleosome assembly; lEA. 

DR InterPro; IPR001951; Histone_H4. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS00047; HIST0NE_H4; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 199 AA; 22642 MW; 7CFA44CC568DF6D8 CRC64; 

Query Match 49.4%; Score 348; DB 4; Length 199; 

Best Local Similarity 64,0%; Pred. No, l,7e-27; 

Matches 64; Conservative 18; Mismatches 18; Indels 0; Gaps 0; 

Qy 38 MDGQKKHWKDKVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 

II : I I : : II I I I I II I : I I : I I I : I M III I I : I I I I I : I II II Ml 

Db 1 MDCVWSNWKSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATIS 60 

Qy 98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

I I I I I I : I I : I : I I I II I : I I I I I : : I : I : I II : : 
Db 61 FRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 100 



Search completed: September 29^ 2004, 18:19:49 
Job time : 12.0927 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 29, 2004, 18:04:48 ; Search time 2.02133 Seconds 

(without alignments) 
3632.211 Million cell updates/sec 

US-09-830-972-32 
705 

1 QASGEAGVSCLRENFAVYSV ESEVAISEELVQKYSNSALG 141 

BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 



Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 




Description 


1 


495 


70. 


,2 


199 


1 


RTN4 


_MOUSE 


Q99p72 


mus musculu 


2 


447 


63. 


,4 


1163 


1 


RTN4^ 


_RAT 


Q9jkll 


rattus norv 


3 


447 


63. 


,4 


1192 


1 


RTN4' 


_HUMAN 


Q9nqc3 


homo sapien 


4 


337 


47. 


, 8 


lie 


1 


RTNl^ 


_HUMAN 


Q16799 


homo sapien 


5 


337 


47, 


.8 


111 


1 


RTNl^ 


_RAT 


Q64548 


rattus norv 


6 


309 


43. 


, 8 


236 


1 


RTN3" 


_HUMAN 


095197 


homo sapien 


7 


308 


43. 


.7 


237 


1 


RTN3^ 


_MOUSE 


Q9es97 


mus musculu 


8 


214 


30. 


,4 


545 


1 


RTN2 


_HUMAN 


075298 


homo sapien 


9 


197 


27. 


.9 


471 


1 


RTN2 


MOUSE 


070622 


mus musculu 


10 


77.5 


11, 


.0 


243 


1 


T2RD^ 


__MOUSE 


Q9jka2 


mus musculu 


11 


77.5 


11. 


.0 


261 


1 


PHSC 


_ECOLI 


P77409 


escherichia 


12 


75.5 


10. 


.1 


246 


1 


T2R8' 


_MOUSE 


Q9jkaO 


mus musculu 


13 


75.5 


10. 


.7 


545 


1 


G6PI^ 


_HELPY 


025781 


helicobacte 


14 


74.5 


10. 


.6 


545 


1 


G6PI^ 


_HELPJ 


Q92k49 


helicobacte 


15 


74 


10. 


.5 


268 


1 


YC73^ 


_HAJEIN 


P44150 


haemophilus 


16 


72 


10. 


,2 


614 


1 


S6AC' 


RABIT 


P48055 


oryctolagus 


17 


72 


10, 


.2 


3511 


1 


MY 15 


MOUSE 


Q9qzz4 


mus musculu 



18 


71 


10 


. 1 


346 


1 


3BHS_VACCC 


P21097 


V 3 beta-hy 


19 


71 


10. 


.1 


346 


1 


3BHS VACCV 


P26670 


V 3 beta-hy 


20 


70 


9, 


. 9 


960 


1 


GBR1_M0U3E 


Q9wvl8 


mus musculu 


21 


70 


9, 


.9 


961 


1 


GBR1_HUMAN 


Q9ubs5 


homo sapien 


22 


70 


9, 


.9 


991 


1 


GBR1_RAT 


Q9z0u4 


rattus norv 


23 


69 


9. 


.8 


315 


1 


LXD1_PH0LE 


P21309 


photobacter 


24 


69 


9, 


.8 


398 


1 


PGK_STRPN 


Q97s89 


streptococc 


25 


69 


9, 


.8 


468 


1 


YDBM_CAEEL 


Q19084 


caenorhabdi 


26 


68.5 


9, 


.7 


238 


1 


T2RA_M0USE 


Q9 jka3 


mus musculu 


27 


68.5 


9, 


.7 


311 


1 


HTRB_HAEIN 


P45239 


haemophilus 


28 


68.5 


9, 


,1 


325 


1 


VP35 VARV 


P33059 


variola vir 


29 


68 


9. 


.6 


184 


1 


YDB5 SCHPO 


Q10358 


s chizosacch 


30 


68 


9. 


. 6 


1447 


1 


DCC_HUMAN 


P43146 


homo sapien 


31 


67.5 


9. 


. 6 


324 


1 


VP35_VACCC 


P20497 


vaccinia vi 


32 


67.5 


9. 


. 6 


503 


1 


LEU1_BUCUM 


Q9evh0 


buchnera ap 


33 


67.5 


9, 


.6 


877 


1 


SULH_SCHPO 


074377 


s chizosacch 


34 


67 


9. 


.5 


175 


1 


OLE2_BRANA 


P29111 


brassica na 


35 


67 


9. 


.5 


453 


1 


SYS ARCFU 


028244 


archaeoglob 


36 


67 


9. 


.5 


525 


1 


SYH CAEEL 


P34183 


caenorhabdi 


37 


67 


9. 


,5 


756 


1 


rirT HAEIN 


P43754 


haemophilus 


38 


66.5 


9, 


,4 


3174 


1 


CHAC HUMAN 


Q96rl7 


homo sapien 


39 


66 


9. 


,4 


253 


1 


ADH DROAD 


Q00669 


drosophila 


40 


66 


9, 


, 4 


537 


1 


YCUB SCHPO 


059831 

\^ -s-V \J J_ 


<^r'h"i '7osj^rT*Vi 


41 


66 


9, 


,4 


548 


1 


AMDS~EMENI 


P08158 


emericella 


42 


65,5 


9. 


.3 


182 


1 


Y696_METJA 


Q58107 


methanococc 


43 


65.5 


9. 


,3 


398 


1 


ACK1_NEIMB 


Q9jyitil 


neisseria m 


44 


65.5 


9. 


,3 


664 


1 


ntpi_enthr 


P43439 


enterococcu 


45 


65.5 


9. 


3 


880 


1 


DPOl BACSU 


034996 


bacillus su 



ALIGNMENTS 



RESULT 1 
RTN4_M0USE 

ID RTN4_M0USE STANDARD; PRT; 199 AA. 

AC Q99P72; Q9CTE3; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) . 

GN RTN4 OR NOGO. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=3T3-L1; TISSUE-Adipocyte ; 

RA Coulson A.C., Craggs P.D., Morris N.J.; 

RT "Mouse vp20/RTN4C cDNA."; 

RL Submitted (DEC-2000) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 170-199 FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE^Embryo ; 

RX MEDLINE=21085660; PubMed=l 12 17 851 ; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M. , Itoh M. , Ishii Y., 



RA Arakawa T., Kara A., Fukunishi Y., Konno H., Adachi J.^ Fukuda S., 

RA Aizawa K., Izawa M. Nishi K., Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y., Gojobori T., Bono H., Kasukawa T., Saito R,, 

RA Kadota K. , Matsuda H.A., Ashburner M, , Batalov S., Casavant T., 

RA Fleischmann W., Gaasterland T., Gissi C, King B., Kochiwa H,, 

RA Kuehl P., Lewis S., Matsuo Y.^ Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M,, Staubli F. , Suzuki R. , Tomita M, , Wagner L., Washio T., 

RA Sakai K. , Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C. , Fletcher Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D., Hofmann M. , Hume D.A., Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B.^ Ringwald M., Rodriguez I., Sakamoto N., 

RA Sasaki H., Sato K., Schoenbach C, Seya T., Shibata Y, , Storch K,-F., 

RA Suzuki H., Toyo-oka K. , Wang K.H,, Weitz C, Whittaker Wilming L., 

RA Wynshaw-Boris A., Yoshida K. , Hasegawa Y, , Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

CC FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R, Interacts with Bcl-xl and Bcl-2 (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) , 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=l; 

CC Comment=A number of isoforms may be produced; 

CC Name^l; 

CC IsoId=Q99P72-l; Sequence=Displayed; 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF326337; AAK08076.1; -. 

DR EMBL; AK003859; -; NOT_ANNOTATED_CDS . 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO:0030176; C:integral to endoplasmic reticulum membrane; ISS. 

DR GO; GO:0005635; C: nuclear membrane; ISS. 

DR GO; GO:0005515; F:protein binding; ISS. 

DR GO; GO: 0019987; P:negative regulation of anti-apoptosis ; ISS. 

DR GO; GO: 0030517; P:negative regulation of axon extension; ISS. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT DOMAIN 1 25 CYTOPLASMIC (Potential) . 



FT 


TRANSMEM 


26 


50 


POTENTIAL. 


FT 


DOMAIN 


51 


137 


LUMENAL (Potential). 


FT 


TRANSMEM 


138 


162 


POTENTIAL. 


FT 


DOMAIN 


163 


199 


CYTOPLASMIC (Potential) . 


FT 


DOMAIN 


12 


199 


RETICULON. 


SQ 


SEQUENCE 


199 AA; 


22466 


MW; 07BE5D580059ED9C CRC64; 



Query Match 70.2%; 
Best Local Similarity 97,1%; 
Matches 101; Conservative 



Qy 

Db 



Score 495; DB 1; Length 199; 
Pred. No. 2.6e-41; 
0; Mismatches 3; Indels 



0; Gaps 



0; 



38 MDGQKKHWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 97 
II IN I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
1 MDDQKKRWKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTIS 60 



Qy 

Db 



98 FRIYKGVIQAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I II I I I I I I I I I II II I II I I II I I I I I I I I M I I I II I I M I 
61 FRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 104 



RESULT 2 
RTN4_RAT 

ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9; Q9WUF0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTITUj SEQUENCE. 

RC STRAIN=Sprague-Dawley; TISSUE=Adipocyte ; 

RX MEDLINE=99249816; PubMed=10231557 ; 

RA Morris N.J., Ross S.A., Neveu J.M. , Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family."; 

RL Biochim. Biophys . Acta 1450:68-76(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3) . 

RX MEDLINE=2 012 9258; PubMed=106677 96; 

RA Chen M.S., Huber A.B., Van der Haar M.E., Frank M. , Schnell L., 

RA Spillmann A.A. , Christ F. , Schwab M.E.; 

RT "Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-1."; 

RL Nature 403:434-439(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 4). 

RC STRAIN=Wistar Kyoto; TISSUE=Vascular smooth muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants."; 

RL Submitted (FEB-1999) to the EMBL/GenBank/DDBJ databases. 



RN [4] 

RP FUNCTION. 

RX MEDLINE-22033691; PubMed=12037567 ; 

RA GrandPre T., Li S., Strittmatter S.M.; 

RT "Nogo-66 receptor antagonist peptide promotes axonal regeneration."; 

RL Nature 417:547-551(2002). 

CC FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 

CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=Nogo-A, NI-220-250; 

CC IsoId=Q9JKll-l; Sequence=Di splayed; 

CC Name=2; Synonyms=Nogo-B, Foocen-Ml; 

CC IsoId=Q9JKll-2; Sequence=VSP_005658 ; 

CC Name=3; Synonyms=Nogo-C, VP20; 

CC IsoId=Q9JKll-3; Sequence=VSP_00565 6 , VSP_005657; 

CC Name=^4; Synonyms=Foocen-M2 ; 

CC IsoId=Q9JKll-4; Sequence=VSP_005659 ; 

CC -!- TISSUE SPECIFICITY: Isoforms 1, 2 and 3 are present in optic 

CC nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 

CC present in dorsal root ganglion, sciatic nerve and PC12 cells 

CC after longer exposure. Isoforms 2 and 3 are detected in kidney, 

CC cartilage, skin, lung and spleen. Isoform 3 is expressed at high 

CC level in skeletal muscle. In adult animals isoform 1 is expressed 

CC mainly in the nervous system. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF051335; AAF01564.1; -. 

DR EMBL; AJ242961; CAB71027.1; -. 

DR EMBL; AJ242962; CAB71028.1; -. 

DR EMBL; AJ242963; CAB71029.1; 

DR EMBL; AF132045; AAD31019.1; -. 

DR EMBL; AF132046; AAD31020.1; 

DR GO; GO:0030176; C:integral to endoplasmic reticulum membrane; IDA. 

DR GO; GO: 0005635; C:nuclear membrane; ISS, 

DR GO; GO: 0005515; F:protein binding; ISS. 

DR GO; GO: 0019987; P:negative regulation of anti-apoptosis ; ISS. 

DR GO; GO: 0030517; P:negative regulation of axon extension; ISS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT DOMAIN 1 989 CYTOPLASMIC (Potential). 



FT 


TRANSMEM 


990 


1010 


POTENTIAL. 


FT 


DOMAIN 


1011 


1104 


LUMENAL (Potential) . 


FT 


TRANSMEM 


1105 


1125 


POTENTIAL. 


FT 


DOMAIN 


1126 


1163 


CYTOPLASMIC (Potential) , 


FT 


DOMAIN 


976 


1163 


RETICULON. 


FT 


DOMAIN 


33 


46 


POLY-GLU. 


FT 


DOMAIN 


73 


76 


POLY-ALA. 


FT 


DOMAIN 


140 


145 


POLY- PRO. 


FT 


VARSPLIC 


1 


964 


Missing (in isoform 3) . 


FT 








/FTId=VSP_005656. 


FT 


VARSPLIC 


965 


975 


AVLSAELSKTS -> MDGQKKHWKDK (in isoform 


FT 








3) . 


FT 








/FTId=VSP_005657. 


FT 


VARSPLIC 


173 


975 


Missing (in isoform 2) . 


FT 








/FTId=VSP_005658 . 


FT 


VTVRSPLIC 


192 


975 


Missing (in isoform 4) . 


FT 








/FTId=VSP_005659. 


FT 


CONFLICT 


1130 


1131 


MISSING (IN REF, 3; AAD31020) . 


SQ 


SEQUENCE 


1163 


AA; 1263E 


i6 MW; 8CB894B09E94F0B6 CRC64; 



Query Match 63.4%; Score 447; DB 1; Length 1163; 

Best Local Similarity 96.9%; Pred. No. 8.2e-36; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I I I I N I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I 
Db 973 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1032 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNSALG 141 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1033 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1068 



RESULT 3 
RTN4_HUMAN 

ID RTN4_HUMAN STANDARD; PRT; 1192 AA. 

AC Q9NQC3; 094962; Q9BXG5; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; Q9Y2Y7; 

AC Q9Y5U6; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel, 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Neuroendocrine-specific protein) (NSP) (Neuroendocrine specific 

DE protein C homolog) (RTN-x) (Reticulon 5) {My043 protein) . 

GN RTN4 OR NOGO OR ASY OR KIAA0886. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RX MEDLINE=20129242; PubMed=106677 80 ; 

RA Prinjha R. , Moore S.E., Vinson M. , Blake S., Morrow R. , Christie G., 

RA Michalovich D., Simmons D.L., Walsh F.S.; 

RT "Inhibitor of neurite outgrowth in humans."; 

RL Nature 403:383-384(2000). 

RN [2] 



RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2) . 

RC TISSUE=Brain; 

RX MEDLINE=21010696; PubMed=11126360 ; 

RA Tagami S., Eguchi Y,, Kinoshita M. , Takeda M. Tsujimoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity."; 

RL Oncogene 19:5736-5746(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RX MEDLINE-20237542; PubMed=1077368 0 ; 

RA Yang J., Yu L., Bi A.D., Zhao S.-Y,; 

RT "Assignment of the human reticulon 4 gene (RTN4} to chromosome 

RT 2pl4 — >2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 88:101-102(2000). 
RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4) . 

RA Jin W.-L., Ju G.; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A."; 

RL Submitted (NOV-2000) to the EMBL/GenBank/DDBJ databases. 
RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Placenta, and Skeletal muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL/GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC TISSUE-Fibroblast; 

RA Yutsudo M. ; 

RT "Isolation of a cell death-inducing gene."; 

RL Submitted (JUN-1998) to the EMBL/GenBank/DDBJ databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RC TISSUE=Pituitary; 

RA Song H., Peng Y., Zhou J., Huang Q., Dai M. , Mao Y.M., Yu Y., Xu X., 

RA Luo B., Hu R., Chen J.; 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene."; 

RL Submitted (JUL-1998) to the EMBL/GenBank/DDBJ databases. 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y., Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y., 

RA Yu J. , Han L.H. ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth."; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases. 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RX MEDLINE=99156230; PubMed=100484 85 ; 

RA Nagase T., Ishikawa K.-I., Suyama M. , Kikuno R., Hirosawa M. , 

RA Miyajima N., Tanaka A., Kotani H., Nomura N. , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 5:355-364(1998). 

RN [10] 



RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Brain, Ovary, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H,, Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A,, Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J.A, , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A,, Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J,, Schmutz J,, Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U,, Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RX MEDLlNE-20499367; PubMed=ll 042 152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S.-X., Zhao M. , Zhao C.-J., Fu G. , 

RA Shen Y. , Fan H.-Y., Lu G., Zhong M. , Xu X.-R., Han Z.-G., Zhang J.-W., 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J., Chen S.-J., Chen Z.; 

RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34+ hematopoietic 

RT stem/progenitor cells."; 

RL Genome Res. 10:1546-1560(2000). 

RN [12] 

RP SEQUENCE OF 482-1192 FROM N.A. (ISOFORM 1/4) . 

RC TISSUE=Brain; 

RA Mao Y.M., Xie Y., Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL/GenBank/DDBJ databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1) . 

RC TISSUE=Testis; 

RA Sha J.H., Zhou Z.M., Li J.M. ; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDBJ databases. 

RN [14] 

RP TOPOLOGY. 

RC TISSUE=Brain; 

RX MEDLINE=20129259; PubMed=l 06677 97 ; 

RA GrandPre T., Nakamura F. , Vartanian T., Strittmatter S.M.; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 403:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE=Brain; 

RX MEDLINE=21069055; PubMed=112 017 42 ; 

RA Fournier A.E., Grandpre T., Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 



RT regeneration."; 

RL Nature 4 09:341-34 6(2 001). 

RN [16] 

RP REVIEW. 

RX MEDLINE=21888956; PubMed=11891768 ; 

RA Ng C.E.L., Tang B.L.; 

RT "Nogos and the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration . " ; 

RL J. Neurosci. Res. 67:559-565(2002), 

CC FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults . 

Isoform 2 reduces the anti-apoptotic activity of Bcl-xl and Bcl-2. 
This is likely consecutive to their change in subcellular 
location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

through 2 putative transmembrane domains. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=RTN 4 A, Nogo-A, RTN-xL; 

CC IsoId=Q9NQC3-l ; Sequence=Displayed; 

CC Name=2; Synonyms^RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2; Sequence=VSP_005655 ; 

CC Name=^3; Synonyms=RTN 4C, Nogo-C, Foocen-S; 

CC IsoId-Q9NQC3-3; Sequence=VSP_005652 , VSP_005653; 

CC Name=4; 

CC IsoId=Q9NQC3-4 ; Sequence=VSP_005654 ; 

CC -!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 

CC and testis and weakly in heart and skeletal muscle. Isoform 2 is 

CC widely expressed excepted for the liver. Isoform 3 is expressed in 

CC brain, skeletal muscle and adipocytes. Isoform 4 is testis- 

CC specific, 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC -!- CAUTION: Ref.ll sequence differs from that shown due to 

CC frameshifts in positions 1149 and 1156. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ251383; CAB99248.1; -. 

DR EMBL; AJ251384; CAB99249.1; -. 

DR EMBL; AJ251385; CAB99250.1; -. 

DR EMBL; AB040462; BAB18927.1; -. 

DR EMBL; AB040463; BAB18928.1; -, 

DR EMBL; AF148537; AAG12176.1; -. 

DR EMBL; AF148538; AAG12177.1; -. 

DR EMBL; AF087901; AAG12205.1; -. 

DR EMBL; AF320999; AAG40878.1; -. 

DR EMBL; AF132047; AAD31021.1; 

DR EMBL; AF132048; AAD31022.1; -. 



DR EMBL; AB015639; BAA83712.1; - 

DR EMBL; AF077050; AAD27783.1; - 

DR EMBL; AF177332; AAG17976.1; - 

DR EMBL; AB020693; BAA74909.1; - 

DR EMBL; BC001035; 7^01035.1; - 

DR EMBL; BC007109; AAH07109.1; - 

DR EMBL; BC014366; AAH14366.1; - 



Query Match 63.4%; Score 447; DB 1; Length 1192; 

Best Local Similarity 96.9%; Pred. No. 8,4e-36; 

Matches 93; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I I N I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 1002 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1061 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSNS7VLG 141 

III I I M II II I I I I I I I I I I I I I I I M II I I I I I 
Db 1062 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALG 1097 



RESULT 4 
RTN1_HUMAN 

ID RTN1_HUMAN STANDARD; PRT ; 776 AA. 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) . 

GN RTNl OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTNl-A; RTNl-B AND RTNl-C) . 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=93293865; PubMed=7 685762 ; 

RA Roebroek A.J.M., Van de Velde H.J.K., Van Bokhoven A., Broers J.L.V., 

RA Ramaekers F.C.S., Van de Ven W.J.M.; 

RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specific gene and identification of its 135-kDa 

RT translational product."; 

RL J. Biol. Chem. 268:13439-13447(1993). 

RN [2] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE-96429995; PubMed=8 83314 5 ; 

RA Roebroek A.J.M., Ayoubi T.A.Y., Van de Velde H.J.K., 

RA Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.J.M.; 

RT "Genomic organization of the human NSP gene, prototype of a novel gene 

RT family encoding reticulons . " ; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE=98228245; PubMed=95604 66; 

RA Hens J., Nuydens R. , Geerts H., Senden N.H., Van de Ven W.J.M., 

RA Roebroek A. J., van de Velde H.J.K., Ramaekers F.C., Broers J.l!; 



RT "Neuronal differentiation is accompanied by NSP-C expression."; 

RL Cell Tissue Res. 292:229-237(1998). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC SUBCELLULAR LOCATION: Endoplasmic reticulum membrane. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=RTNl-A; Synonyms=NSP-A; 

CC IsoId=Ql6799-l; Sequence==Displayed; 

CC Name=:RTNl-B; Synonyms=NSP-B; 

CC IsoId=Q16799-2; Sequence=VSP_005644 ; 

CC Name=RTNl-C; Synonyms=NSP-C; 

CC IsoId=Q16799-3; Sequence=VSP_005645, VSP_005646; 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 
CC AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTNl-C 

CC IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION. 

CC PTM: Isoforms RTNl-A and RTNl-B are phosphorylated . 

CC SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L10333; AAA59950.1; -. 

DR EMBL; L10334; AAA59951.1; -. 

DR EMBL; L10335; AAA59952.1; -. 

DR PIR; A46583; A46583. 

DR PIR; 160904; 160904. 

DR Genew; HGNC:10467; RTNl. 

DR MIM; 600865; -. 

GO; GO:0030176; C:integral to endoplasmic reticulum membrane; TAS , 
GO; GO: 0004871; F: signal transducer activity; NAS . 

DR GO; GO: 0030182; P:neuron differentiation; TAS. 

DR GO; GO: 0007165; P: signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 



DR 
DR 



DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane; 


KW 


Phosphorylation . 






FT 


TRANSMEM 


603 


623 


POTENTIAL. 


FT 


TRANSMEM 


726 


746 


POTENTIAL. 


FT 


DOMAIN 


589 


776 


RETICULON. 


FT 


DOMAIN 


609 


612 


POLY-LEU. 


FT 


VARSPLIC 


1 


420 


Missing (in isoform RTNl-B) . 


FT 








/FTId=VSP_005644. 


FT 


VARSPLIC 


1 


568 


Missing (in isoform RTNl-C) . 


FT 








/FTId=VSP_005645. 


FT 


VARSPLIC 


569 


588 


GPGPLGPGAPPPLLFLNKQK -> MQATADSTKMDCVWSNW 


FT 








KSQ (in isoform RTNl-C) . 


FT 








/FTId=VSP_005646. 


SQ 


SEQUENCE 


776 AA; 


83617 MW 


; CA5B6232353096FE CRC64; 



Query Match 



47.8%; Score 337; DB 1; Length 776; 



Best Local Similarity 67.4%; Pred. No. 3e-25; 

Matches 62; Conservative 16; Mismatches 14; Indels 0; Gaps 0 

Qy 4 6 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSA/TISFRIYKGVI 105 

I I I II III Ihlll Ihlll II IIIMIM h 

586 KQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYIJUiAALSATISFRIYKSV^ 645 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

II: I : I I I II I : I I I I I : : I : I : I I I : : 
Db 646 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 677 



RESULT 5 
RTNl RAT 



ID RTN1_RAT STANDARD; PRT; 777 AA. 

AC Q64548; Q64547; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-speci f ic protein) (S-rex) . 

GN RTNl OR NSP. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus, 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTNl-B AND RTNl-S). 

RC STRAIN=Wistar; TISSUE=Brain cortex; 

RX MEDLINE=96386034; PubMed=8793864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC SUBCELLULAR LOCATION: Endoplasmic reticulum membrane (By 

CC similarity) . 

CC ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTNl-B; Synonyms=S-RexB; 

CC IsoId=Q64548-l; Sequence-Displayed; 

CC Name=RTNl-S; Synonyms=S-RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647 , VSP_005648; 

CC TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN. 

CC EXPRESSION OF ISOFORM RTNl-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES. 

CC DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY ElO IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN. DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT E13 IN 

CC THE HINDBRAIN AND AT E14 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC SIMILARITY: Contains 1 reticulon domain. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U17604; AAC53046.1; 

DR EMBL; U17603; AAC53045.1; 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 



DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 


TRANSMEM 


604 


624 


POTENTIAL, 


FT 


TRANSMEM 


727 


747 


POTENTIAL. 


FT 


DOMAIN 


590 


111 


RETICULON. 


FT 


DOMAIN 


610 


613 


POLY-LEU. 


FT 


VT^SPLIC 


1 


569 


Missing (in isoform RTNl-S) . 


FT 








/FTId=VSP_005647. 


FT 


VARSPLIC 


570 


589 


IPGPLGSDLVPPLPFFNKQK -> MQATADSTKMDCVWSNW 


FT 








KSQ (in isoform RTNl-S) . 


FT 








/FTId=VSP_005648. 


SQ 


SEQUENCE 


777 AA; 


83001 MW; 


AF7479C50F28D0AC CRC64; 



Query Match 47,8%; Score 337; DB 1; Length 777; 

Best Local Similarity 67.4%; Pred. No. 3e-25; 

Matches 62; Conservative 16; Mismatches 14; Indels 0; Gaps 0; 

Qy 46 KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 105 

I I -111111111:11:111: | | | | | | | | : | | | | | : | | | | { | | | | | | | | | : 
^87 KQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 646 

Qy 106 QAIAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

II: I : I I I I I I : M I I I : : I : I : | | | : : 
Db 647 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTD 678 



RESULT 6 




RTN3 


_HUMAN 




ID 


RTN3 HUMAN STANDARD; PRT; 236 AA, 




AC 


095197; 




DT 


16-OCT-2001 (Rel. 40, Created) 




DT 


16-OCT-2001 (Rel. 40, Last sequence update) 




DT 


lO-OCT-2003 (Rel, 42, Last annotation update) 




DE 


Reticulon protein 3 (Neuroendocrine-specif ic protein 


-like 2) (NSP-like 


DE 


protein II) (NSPLII) . 


GN 


RTN3 OR NSPL2. 




OS 


Homo sapiens (Human) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi; 


oc 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 


Homo , 


OX 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 




RC 


TISSUE=Retina; 




RX 


MEDLINE=99265974; PubMed=10331947 ; 




RA 


Moreira E.F., Jaworski C.J., Rodriguez I.R.; 





RT "Cloning of a novel member of the reticulon gene family {RTN3) : gene 

RT structure and chromosomal localization to llql3."; 

RL ■ Genomics 58:73-81(1999). 
RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Du G,, Yuan J., Qiang B . ; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specific protein-like protein 1: NSPLl."; 

RL Submitted (JAN-1999) to the EMBL/GenBank/DDBJ databases! 
RN [3] 

RP SEQUENCE FROM N.A, 

RC TISSUE-Brain, Eye, and Lymph; 

RX MEDLINE=22388257; PubMed=12 4 7 7 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G,, 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K,, 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A.A. , Rubin G.M., Hong L., 

RA Stapleton M., Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E. 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K. J. , Malek J. A., Gunaratne P.H,, 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X,, Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R,M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E,, Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 

CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA. 

CC SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to licenseliisb-sib . ch) . 

CC 

DR EMBL; AF059524; AAC99319.1; 

DR EMBL; AF059529; 7^20951.1; -. 

DR EMBL; AF059525; AAD20951.1; JOINED. 

DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD20951.1; JOINED. 

DR EMBL; AF059528; AAD20951.1; JOINED. 

DR EMBL; AF119297; AAD26810.1; 

DR EMBL; BC000634; AAH00634.1; -. 

DR EMBL; BC010556; AAH10556.1; -. 



DR EMBL; BC011394; AAH11394.1; 

DR EMBL; BC022993; AAH22993.1; -. 

DR Genew; HGNC: 104 69; RTN3 . 

DR MIM; 604249; -. 

DR GO; GO: 0005615; C : extracellular space; TAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 68 88 POTENTIAL. 

FT TR7\NSMEM 177 197 POTENTIAL. 

FT DOMAIN 4 8 236 RETICULON. 

SQ SEQUENCE 236 AA; 25609 MW; DDC6A4 54 4ABCDFB7 CRC64; 

Query Match 43.8%; Score 309; DB 1; Length 236; 

Best Local Similarity 60.9%; Pred. No. 4.7e-23; 

Matches 56; Conservative 18; Mismatches 18; Indels 0; Gaps 0 

Qy 49 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 108 

I ll::MI:|lll IN :i :|||| ||::|| :|: | | | | | | | | | | | | | | ||||: 
48 VHDLIFWRDVKKTGFVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRIYKSVIQAV 107 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSAL 140 

I I : M I I I : I I I : :: : | | | | : | : 

Db 108 QKSEEGHPFKAYLDVDITLSSEAFHNYMNAAM 139 



RESULT 7 
RTN3_M0USE 

ID RTN3_M0USE STANDARD; PRT; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Qiang H., Yuan J., Qiang B.; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homolog. "; 

RL Submitted (OCT-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Eye; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A.A. , Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 



RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W,, 

RA Villalon D.K,, Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M, , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C, 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential). 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF195940; AAG31360.1; 

DR EMBL; BC014697; AAH14697.1; 

DR MGD; MGI: 1339970; Rtn3. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 89 POTENTIAL. 

FT TRANSMEM 167 187 POTENTIAL. 

FT DOMAIN 4 9 237 RETICULON. 

SQ SEQUENCE 237 AA; 25428 MW; EB6OA0A7AC45F0DE CRC64; 

Query Match 43.7%; Score 308; DB 1; Length 237; 

Best Local Similarity 59.8%; Pred. No. 6e-23; 

Matches 55; Conservative 19; Mismatches 18; Indels 0; Gaps 0; 

Qy 4 9 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAI 108 

I I I I I I : I I I I III : I : II M II : : II : I : I I I I I I I II I I : I I MM: 
4 9 VHDLIFWRDVKKTGFVFGTTLIMLLSLAAFSVISWSYLILALLSVTISFRVYKSVIQAV 108 

Qy 109 AKSDEGHPFRAYLESEVAISEELVQKYSNSAL 140 

I M M II M I I I: : : : | | | \:\: 

Db 109 QKSEEGHPFKAYLDVDITLSSEAFHNYMNAAM 140 



RESULT 8 
RTN2_HUMAN 

ID RTN2_HUMAN STANDARD; PRT; 545 AA. 

AC 075298; 060509; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 



DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI). 

GN RTN2 OR NSPLI. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A., ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=98360096; PubMed=9693037 ; 

RA Roebroek A.J.M., Contreras B., Pauli I.G.L., Van de Ven W.J.M.; 

RT "cDNA cloning, genomic organization, and expression of the human RTN2 

RT gene, a member of a gene family encoding reticulons . " ; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N.A, (ISOFORM RTN2-B) . 

RC TISSUE=Brain; 

RX MEDLINE=98191726; PubMed=9530 622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression,"; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTN2-A; 

Isold=075298-1; Sequence=Displayed; 

Note=Isoform RTN2-C is produced by alternative initiation at 
GC Met-341 of isoform RTN2-A; 

CC Name=RTN2-B; 

IsoId=075298-2; Sequence=VSP_005649 ; 
CC Event=Alternative initiation; 

CC Comment=2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 

CG by alternative initiation at Met-1 and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib , ch) . 

CC 

DR EMBL; AF004222; AAC32542.1; -. 

DR EMBL; AF004223; AAC32543.1; -. 

DR EMBL; AF004224; AAC32544.1; -. 

DR EMBL; AF038540; 7\AC14910.1; 

DR Genew; HGNC: 104 68; RTN2 . 

DR MIM; 603183; -. 

DR GO; GO: 0030176; C:integral to endoplasmic reticulum membrane; NAS. 

DR GO; GO: 0004871; F:signal transducer activity; NAS. 

DR GO; GO: 0007165; P:signal transduction; NAS. 



DR 


InterPro; 


IPR0033£ 


J8; Reticulon, 


DR 


Pfam; PF02453; Reticulon; 


1. 




DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane; 


PCW 


Alternative initiation. 






FT 


CHAIN 


1 


545 




RETICULON PROTEIN 2, ISOFORM RTN2 


FT 


CHAIN 


341 


545 




RETICULON PROTEIN 2, ISOFORM RTN2 


FT 


INIT_MET 


341 


341 




FOR ISOFORM RTN2-C. 


FT 


TRANSMEM 


368 


388 




POTENTIAL. 


r J. 


TRANSMEM 


463 


483 




POTENTIAL. 


FT 


DOMAIN 


345 


545 




RETICULON. 


FT 


VARSPLIC 


272 


344 




Missing (in isoform RTN2-B) . 


FT 










/FTId=VSP 005649. 


SQ 


SEQUENCE 


545 AA; 


59263 


MW; 


971FD2F909E1E9E6 CRC64; 


Query Match 




30.4^ 




Score 214; DB 1; Length 545; 



Best Local Similarity 46.7%; Pred. No. 2.2e-13; 



Matches 



42; Conservative 21; Mismatches 27; Indels 



0; Gaps 



Qy 

Db 

Qy 

Db 



48 KVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQA 107 
I I I M M : I : : I I I I : III II II II I : : || || I I I | : | : | : | | 
344 KVADLLYWKDTRTSGWFTGLMVSLLCLLHFSIVSVAAHLALLLLCGTISLRVYRKVLQA 4 03 

108 lAKSDEGHPFRAYLESEVAISEELVQKYSN 137 

: : I : I I : II I : : : : : I : : | : 
4 04 VHRGDGANPFQAYLDVDLTLTREQTERLSH 433 



RESULT 9 




RTN2 


_MOUSE 




ID 


RTN2 MOUSE STANDARD; PRT; 471 AA. 




AC 


070622; 070620; 




DT 


16-OCT-2001 (Rel. 40, Created) 




DT 


16-OCT-2001 (Rel. 40, Last sequence update) 




DT 


15-MAR-2004 (Rel. 43, Last annotation update) 




DE 


Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) 


(NSP-like 


DE 


protein 1) (NSPLI) . 


GN 


RTN2 OR NSPLI. 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 


Mus , 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY 




RC 


STRAIN=FVB/N, and 129/Sv; TISSUE=Cerebellum, and Skeletal muscle; 


RX 


MEDLINE=9819172 6; PubMed-9530 622 ; 




RA 


Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 




RT 


"Molecular cloning of a novel mouse gene with predominant muscle and 


RT 


neural expression. "; 




RL 


Mamm. Genome 9:274-282(1998). 




RN 


[2] 




RP 


SEQUENCE FROM N.A. (ISOFORM 1) . 




RC 


TISSUE=Retina; 




RX 


MEDLINE=22388257; PubMed=12477932 ; 




RA 


Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G,, 




RA 


Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler 


G.D. , 


RA 


Altschul S.F,, Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 



RA Hopkins R.F., Jordan H., Moore T., Max S,I,, Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci p., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abrarnson R.D,, Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J,, Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S,, Sanchez A., 

RA Whiting M. , Madan A., Young A.C, Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Membrane-bound. Endoplasmic reticulum 

CC (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; Synonyms=Brain; 

CC IsoId=O70622-l; Sequence=Displayed; 

CC Name-2; Synonyms=Muscle; 

CC IsoId=O70622-2; Sequence=VSP_005650, VSP_005651; 

CC -!- TISSUE SPECIFICITY: Expressed predominantly in neural and muscular 
CC tissues. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute, There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www. isb-sib , ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF038537; AAC14906.1; 

DR EMBL; AF038537; AAC14907,1; -. 

DR EMBL; AF038538; AAC14908.1; 

DR EMBL; AF038539; AAC14909.1; -. 

DR EMBL; AF093624; AAD13195.1; -. 

DR EMBL; BC031370; AAH31370.1; -. 

DR MGD; MGI: 107612; Rtn2 , 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 295 315 POTENTIAL. 

FT DOMAIN 272 471 RETICULON. 

FT VARSPLIC 1 267 Missing ( in isof orm 2 ) . 

/FTId=VSP_005650 . 

FT VARSPLIC 268 271 PLLL -> MGSK (in isoform 2). 

/FTId=VSP_005651 . 

SQ SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64; 



Query Match 



27.9%; Score 197; DB 1; Length 471; 



Best Local Similarity 44.3%; Pred. No. 8.6e-12; 

Matches 39; Conservative 20; Mismatches 29; Indels 0; Gaps 0 

Qy 49 WDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYITVLALLSVTISFRIYKGVIQAI 108 

I I I I I I : I :: I I I : || | |||||| |::|| | ||| |:|: |:||: 
272 VADLLYWKDTRTSGAVFTGLMASLLCLLHFSIVSVAAHLALLGLCATISLRVYRKVLQAV 331 

Qy 109 Ty^SDEGHPFRAYLESEVAISEELVQKYS 136 

: I : I I : I I I : : : : : I : : | 
Db 332 HRGDGTNPFQAYLDMDLTLTREQTERLS 359 



RESULT 10 
T2RD MOUSE 



ID 

AC 

DT 

DT 

DT 

DE 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RC 

RX 

RA 

RT 

RL 

RN 

RP 

RX 

RA 

RT 

RL 

RN 

RP 

RX 

RA 

RT 

RL 

RN 

RP 

RX 

RA 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 



T2RD_M0USE 
Q9JKA2; 
lO-OCT-2003 
lO-OCT-2003 
lO-OCT-2003 



STANDARD; 



PRT; 



243 AA. 



Created) 

Last sequence update) 
Last annotation update) 

member 13 (T2R13) (Taste receptor family B 



Rodentia; 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus , 



in human and mouse."; 



{Rel. 42, 
(Rel. 42, 
(Rel. 42, 
Taste receptor type 2 
member 3) (TRB3) (Fragment) . 
TAS2R13. 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A, 
STRAIN=DBA/2J; 

MEDLINE=2 0227309; PubMed=l 0766242; 
Matsunami H., Montmayeur J. -P., Buck L 
"A family of candidate taste receptors 
Nature 404:601-604(2000). 
[2] 

REVIEW. 

MEDLINE=22135574; PubMed=12139982 ; 
Montmayeur J. -P., Matsunami H.; 
"Receptors for bitter and sweet taste."; 
Curr. Opin. Neurobiol. 12:366-371(2002). 
[3] 

REVIEW. 

MEDLINE=2 1634 924; PubMed=11696554 ; 
Margolskee R. F. ; 

"Molecular mechanisms of bitter and sweet taste transduction."; 
J. Biol. Chem. 277:1-4(2002). 
[4] 

REVIEW. 

MEDLINE=22469025; PubMed=12 58 152 0 ; 

Zhang Y., Hoon M.A. , Chandrashekar J., Mueller K.L., Cook B., Wu 

Zuker C.S., Ryba N.J.; 

"Coding of sweet, bitter, and umami tastes: different receptor cells 

sharing similar signaling pathways."; 

Cell 112:293-301(2003). 

FUNCTION: Receptor that may play a role in the perception of 
bitterness and is gustducin-linked . May play a role in sensing the 
chemical composition of the gastrointestinal content. The activity 
of this receptor may stimulate alpha gustducin, mediate PLC-beta-2 



D., 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



activation and lead to the gating of TRPM5 . 
-!- SUBCELLULAR LOCATION: Integral membrane protein. 

TISSUE SPECIFICITY: Expressed in subsets of taste receptor cells 
of the tongue and palate epithelium and exclusively in gustducin- 
positive cells. 

MISCELLANEOUS: Most taste cells may be activated by a limited 
number of bitter compounds; individual taste cells can 
discriminate among bitter stimuli, 

SIMILARITY: Belongs to family T2R of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 



EMBL; AF2 

MGD; MGI: 

InterPro; 

Pfam; PFO 

Receptor; 

NON_TER 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

DOMAIN 

TRANSMEM 

CARBOHYD 

NON_TER 

SEQUENCE 



47733; AAF64510.1; 
1890148; Tas2rl3. 

IPR007960; TAS2R. 
5296; TAS2R; 1. 

G-protein coupled 



1 

<1 

13 

34 

55 

76 
100 
121 
151 
172 
196 
217 
223 
128 
243 
243 AA; 



1 

12 

33 

54 

75 

99 
120 
150 
171 
195 
216 
222 
>243 
128 
243 

28110 MW 



receptor; Transmembrane. 

CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIT^i) . 

5 (POTENTIAL) , 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL). 
EXTRACELLULAR (POTENTIAL) . 

7 (POTENTIAL). 

N-LINKED (GLCNAC. . .) (POTENTITVL) 
; D8AD14AF95B9E0B2 CRC64; 



Query Match 11.0%; Score 77.5; DB 1; 

Best Local Similarity 27.1%; Pred. No. 2; 
Matches 32; Conservative 18; Mismatches 47; 



Length 243; 
Indels 21; 



Gaps 



5; 



Qy 

Db 

Qy 

Db 



17 VYSVSVGMHNLLLLEGRSWQEMDGQKKH— WKDKWDLLYWRDIKKTGWFGASLFLLL 73 

= li =1^: M II : : I: I | |:|| | 

37 LYSALMTTRKVLIIFNNSWTVIN HFNIWLATCLSIFYFLKIAN FSNSIFLSL 8 8 

74 SLTVFSIVSVTAYIALALLSV TI SFRI YKGVIQAIAKSDEG-HPFRAYL 121 

I ^'IIM ::| II I 11 :|| : || | | :| 

89 RWRVKTWSVTLMMSLLLLFVNVLVINTFIVISVDVYKVNTSYSSHSDNNLHISRIFL 146 



RESULT 11 
PHSC_ECOLI 

ID PHSC_ECOLI STANDARD; PRT; 261 AA. 

AC P77409; 



DT Ol-NOV-1997 (Rel. 35, Created) 

DT Ol-NOV-1997 (Rel. 35, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE PhsC protein homolog. 

GN YDHU OR B167 0. 

OS Escherichia coli . 

OC Bacteria; Proteobacteria; Gammaproteobacteria ; Enterobacteriales ; 

OC Enterobacteriaceae; Escherichia . 

OX NCBI_TaxID=562; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12 / MG1655; 

RX MEDLINE-97426617; PubMed=9278503; 

RA Blattner F.R., Plunkett G. Ill, Bloch C.A., Perna N.T., Burland V., 

RA Riley M. , Collado-Vides J., Glasner J.D., Rode C.K., Mayhew G.F., 

RA Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A. , Rose D.J. 

RA Mau B. , Shao Y. ; 

RT "The complete genome sequence of Escherichia coli K-12."; 

RL Science 277:1453-1474(1997). 

RK [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12 / MG1655; 

RX MEDLINE=97175536; PubMed=9023 1 91 ; 

RA Hensel M. , Shea J.E., Baeumler A. J., Gleeson C, Blattner F R 

RA Holden D.W. ; ' ' ' 

RT "T^alysis of the boundaries of Salmonella pathogenicity island 2 and 

RT the corresponding chromosomal region of Escherichia coli K-12,"; 

RL J. Bacteriol. 179:1105-1111(1997). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12; 

RX MEDLINE=97251357; PubMed=9097 03 9 ; 

RA Aiba H., Baba T., Fujita K. , Hayashi K. , Inada T., Isono K. , 

RA Itoh T., Kasai H., Kashimoto K. , Kimura S., Kitakawa M. , 

RA Kitagawa M. , Makino K., Miki T., Mizobuchi K., Mori H. , Mori T., 

RA Motomura K., Nakade S., Nakamura Y., Nashimoto H., Nishio Y., 

RA Oshima T., Saito N., Sampei G., Seki Y. , Sivasundaram S., 

RA Tagami H., Takeda J., Takemoto K., Takeuchi Y., Wada C, 

RA Yamamoto Y. , Horiuchi T.; 

RT "A 570-kb DNA sequence of the Escherichia coli K-12 genome 

RT corresponding to the 28.0-40.1 min region on the linkage map."; 

RL DNA Res. 3:363-377(1996). 

CC SUBCELLULAR LOCATION: Integral membrane protein. Inner membrane 

CC (Potential) . 

CC -!- SIMILARITY: TO S . TYPHIMURIUM PHSC. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license(aisb-sib . ch) . 

CC 

DR EMBL; AE000262; AAC74740.1; -. 

DR EMBL; U68703; AAB47946.1; -, 

DR EMBL; D90810; BAA15442.1; -. 



DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
SQ 



PIR; F64924; F64924. 
EcoGene; EG13955; ydhU. 
InterPro; IPR000516; Ni_hydr_CytB . 
Pfam; PF012 92; Ni_hydr_CYTB; 1. 

Transmembrane; Inner membrane; Complete proteome. 



TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
SEQUENCE 



25 
81 
108 
182 
224 



45 
101 
128 
202 
244 



POTENTIAL. 
POTENTIAL. 
POTENTIAL. 
POTENTIAL. 
POTENTIAL. 



261 AA; 29583 MW; 65CF1A45691A0AF3 CRC64; 



Qy 

Db 

Qy 

Db 



Query Match 11.0%; Score 77.5; DB 1; Length 261; 

Best Local Similarity 27.6%; Pred. No. 2.2; 

Matches 29; Conservative 16; Mismatches 43; Indels 17; Gaps ^ 

22 VGMHNLLLLEGRSWQEMD-GQKKHWKDKWDLLYWRDIKKTGWFGASLFLLL SLT 76 

= 1 = 111 I I I : I : I I I : I : I I I : I I : 

44 LGLHALLRARGVKKSATDHGEKIYLYSKAVRLWHWSN ALLFVLLLASGLIN 94 

77 VFSIVSVTAYIALALLSVTISFRI — YKGVIQAIAKSDEGHPFR 118 

M :l : I : : I : I I |{ :| 

95 HFAMVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGDNGHHYR 139 



RESULT 12 
T2R8_MOUSE 

ID T2R8_MOUSE STANDARD; PRT; 246 AA. 

AC Q9JKA0; 

DT lO-OCT-2003 (Rel. 42, Created) 

DT lO-OCT-2003 (Rel. 42, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Taste receptor type 2 member 8 (T2R8) (Taste receptor family B member 

DE 5) (TRB5) (Fragment). 

GN TAS2R8. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI__TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=20227309; PubMed=10766242 ; 

RA Matsunami H., Montmayeur J. -P., Buck L.B.; 

RT "A family of candidate taste receptors in human and mouse."; 

RL Nature 4 04:601-604(2 000). 

RN [2] 

RP REVIEW. 

RX MEDLINE=22135574; PubMed=12 139982 ; 

RA Montmayeur J. -P., Matsunami H.; 

RT "Receptors for bitter and sweet taste."; 

RL Curr. Opin. Neurobiol. 12:366-371(2002). 

RN [3] 

RP REVIEW. 

RX MEDLINE=21634924 ; PubMed=11696554 ; 

RA Margolskee R.F.; 

RT "Molecular mechanisms of bitter and sweet taste transduction."; 



RL J. Biol. Chem. 277:1-4(2002), 

RN [4] 

RP REVIEW. 

RX MEDLINE=22469025; PubMed=1258152 0 ; 

RA Zhang Y., Hoon M.A., Chandrashekar J., Mueller K.L,, Cook B., Wu D., 

RA Zuker C.S., Ryba N.J.; 

RT "Coding of sweet, bitter, and umami tastes: different receptor cells 

RT sharing similar signaling pathways."; 

RL Cell 112:293-301(2003). 

CC -!- FUNCTION: Receptor that may play a role in the perception of 

CC bitterness and is gustducin-linked. May play a role in sensing the 

CC chemical composition of the gastrointestinal content. The activity 
of this receptor may stimulate alpha gustducin, mediate PLC-beta-2 

CC activation and lead to the gating of TRPM5 . 

CC SUBCELLULAR LOCATION: Integral membrane protein, 

CC -!- TISSUE SPECIFICITY: Expressed in subsets of taste receptor cells 

of the tongue and palate epithelium and exclusively in gustducin- 

CC positive cells. Expressed in 15% taste bud cells in circumvallate 

CC and foliate papillae but only in 2% in fungiform papillae. 

CC -!- MISCELLANEOUS: Most taste cells may be activated by a limited 

CC number of bitter compounds; individual taste cells can 

CC discriminate among bitter stimuli. 

CC -!- SIMILARITY: Belongs to family T2R of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www, isb-sib . ch/announce/ 

CC or send an email to license@isb-sib , ch) . 

CC 

DR EMBL; AF247735; AAF64512.1; 

DR MGD; MGI: 1890259; Tas2r8. 

DR InterPro; IPR007960; TAS2R. 

DR Pfam; PF05296; TAS2R; 1. 

KW Receptor; G-protein coupled receptor; Transmembrane. 

FT NON_TER 1 1 

FT DOMAIN 1 15 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 16 36 2 (POTENTIAL). 

FT DOMAIN 37 59 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 60 80 3 (POTENTIAL). 

FT DOMAIN 81 102 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 103 123 4 (POTENTIAL). 

FT DOMAIN 124 153 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 154 174 5 (POTENTIAL) . 

FT DOMAIN 175 198 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 199 219 6 (POTENTIAL). 

FT DOMAIN 22 0 225 EXTRACELLUL7VR (POTENTIAL) . 

FT TRANSMEM 226 246 7 (POTENTIAL). 

FT NON_TER 246 24 6 

SQ SEQUENCE 246 AA; 28430 MW; 8B8F96F8A62E44 74 CRC64; 

Query Match 10.7%; Score 75.5; DB 1; Length 246; 
Best Local Similarity 27,1%; Pred. No. 3.3; 

Matches 32; Conservative 18; Mismatches 47; Indels 21; Gaps 5; 



Qy 

Db 

Qy 
Db 



17 VYSVSVGMHNLLLLEGRSWQEMDGQKKH-— WKDKWDLLYWRDIKKTGWFGASLFLLL 73 

•II : :|:: II :: I | : : |: | I |:|l I 

40 LYSALMTTRKVLIIFNNSWTVIN HFNIWLATCLSI FYFLMIAN FSNSIFLSL 91 

74 SLTVFSIVSVTAYIALALLSV TISFRIYKGVIQAIAKSDEG-HPFRAYL 121 

I - II II I II I 11:11 : I I I I : I 

92 RWRVKTWSVTLLMSLLLLFVNVLVINTFIVISVDVYKVNTSYSSHSDNNIHISRIFL 149 



RESULT 13 
G6PI HELPY 



ID 

AC 

DT 

DT 

DT 

DE 

DE 

GN 

OS 

OC 

00 

OX 

RN 

RP 

RC 

RX 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RT 

RT 

RL 

CO 

CO 

CO 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 



G6PI_HELPY 
025781; 
15-JUL-1998 
15-JUL-1998 
28-FEB-2003 



STANDARD; 



PRT; 



545 AA. 



( Phosphoglucose 



(Rel. 36, Created) 
(Rel. 36, Last sequence update) 
(Rel. 41, Last annotation update) 
Glucose-6-phosphate isomerase (EC 5.3.1.9) (GPI) 
isomerase) (PGI) ( Phosphohexose isomerase) (PHI) . 
PGI OR HP1166. 

Helicobacter pylori (Campylobacter pylori) . 

Bacteria; Proteobacteria; Epsilonproteobacteria; Campylobacterales ; 
Helicobacteraceae; Helicobacter . 
NCBI_TaxID=210; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=26695 / ATCC 700392; 
MEDLINE=97394 4 67; PubMed=9252185 ; 

Tomb J.-F., White O., Kerlavage A.R., Clayton R.A. , Sutton G.G., 
Fleischmann R.D., Ketchum K.A., Klenk H.-P., Gill S., Dougherty B.A., 
Nelson K. , Quackenbush J., Zhou L., Kirkness E.F., Peterson S., 
Loftus B., Richardson D., Dodson R., Khalak H.G., Glodek A., 
McKenney K. , FitzGerald L.M., Lee N., Adams M.D., Hickey E.K., 
Berg D.E., Gocayne J.D., Utterback T.R., Peterson J.D., Kelley J.M., 
Cotton M.D., Weidman J.M. , Fujii C, Bowman C, Watthey L., Wallin E., 
Hayes W.S., Borodovsky M. , Karp P.D., Smith H.O,, Eraser CM,, 
Venter J. C. ; 

"The complete genome sequence of the gastric pathogen Helicobacter 
pylori . " ; 

Nature 38 8:539-547(1997). 
-!- CATALYTIC ACTIVITY: 
phosphate. 

PATHWAY: Involved in glycolysis and in gluconeogenesis . 
SUBCELLULAR LOCATION: Cytoplasmic (By similarity) . 
SIMILARITY: Belongs to the GPI family. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AE000622; AAD08211.1; -. 
PIR; F64665; F64665. 
HSSP; Q9N1E2; IHOX. 
TIGR; HP1166; 



D-glucose 6-phosphate = D-fructose 6- 



DR HAMAP; MF_00473; -; 1. 

DR InterPro; IPR001672; G6P_Isomerase . 

DR Pfam; PF00342; PGI; 1. 

DR PRINTS; PR00662; G6PIS0MERASE . 

DR PROSITE; PS00765; P_GLUCOSE_I S0MERASE_1 ; 1. 

DR PROSITE; PS00174; P_GLUCOSE_I S0MERASE_2 ; 1. 

KW Isomerase; Gluconeogenesis ; Glycolysis; Complete proteome. 

FT ACT_SITE 382 382 BY SIMILARITY. 

FT ACT_SITE 510 510 BY SIMILARITY. 

SQ SEQUENCE 545 AA; 62487 MW; BDC68D16251 902 3 6 CRC64; 

Query Match 10.7%; Score 75.5; DB 1; Length 545; 

Best Local Similarity 25.3%; Pred. No, 7,5; 

Matches 37; Conservative 20; Mismatches 44; Indels 45; Gaps 6 

23 GMHNLL LLEGRSWQEMDGQ KKHWKDKWDLLYWRDIKKTGWFGASL 69 

I I • I : : I : I : : I I : | I I : | | : | | | | 

411 GHHEILFSNVLAQAQAFMKGKSYEEALGELLFKGLDKDEAKDLAHHR VFFGNRP 464 

Qy 70 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSD 112 

:l I I :: I :|| I : :||| | | 

465 SNILLLEKISPSNIGALVALYEHKVFV QGVIWDINSFDQWGVELGKELAVPILQE 519 

Qy 113 -EGHPFRAYLESEVAISEELVQKYSN 137 

Ml II :| :: I:: I I 

Db 520 LEGHKSNAYFDSS TKHLIELYKN 542 



RESULT 14 
G6PI_HELPJ 

ID G6PI_HELPJ STANDARD; PRT; 545 AA. 

AC Q9ZK49; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) ■ 

DE Glucose-6-phosphate isomerase (EC 5.3.1.9) (GPI) ( Phosphoglucose 

DE isomerase) (PGI) ( Phosphohexose isomerase) (PHI). 

GN PGI OR JHP1093. 

OS Helicobacter pylori J99 (Campylobacter pylori J99) . 

OC Bacteria; Proteobacteria ; Epsilonproteobacteria; Campylobacterales ; 

OC Helicobacteraceae; Helicobacter. 

OX NCBI_TaxID-85963; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-99120557; PubMed=992 3682 ; 

RA Aim R.A., Ling L.-S.L., Moir D.T., King B.L., Brown E.D., Doig P.O., 

RA Smith D.R., Noonan B., Guild B.C., deJonge B.L., Carmel G., 

RA Tummino P.J., Caruso A., Uria-Nickelsen M. , Mills D.M., Ives C, 

RA Gibson R., Merberg D., Mills S.D., Jiang Q., Taylor D.E., Vovis G.F., 

RA Trust T. J. ; 

RT "Genomic sequence comparison of two unrelated isolates of the human 

RT gastric pathogen Helicobacter pylori."; 

RL Nature 397:176-180(1999). 

CC CATALYTIC ACTIVITY: D-glucose 6-phosphate - D-fructose 6- 

CC phosphate. 

CC PATHWAY: Involved in glycolysis and in gluconeogenesis. 

CC SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 



CC -!- SIMILARITY: Belongs to the GPI family, 
CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE001536; AAD06664.1; 

DR FIR; E71851; E71851. 

DR HSSP; Q9N1E2; IHOX. 

DR HAMAP; MF_00473; 1. 

DR InterPro; IPR001672; G6P_Isomerase . 

DR Pfam; PF00342; PGI; 1. 

DR PRINTS; PR00662; G6PIS0MERASE . 

DR PROSITE; PS00765; P_GLUC0SE_IS0MERASE_1 ; 1. 

DR PROSITE; PS00174; P_GLUC0SE_IS0MERASE_2 ; 1. 

KW Isomerase; Gluconeogenesis ; Glycolysis; Complete proteome. 

FT ACT_SITE 382 382 BY SIMILARITY. 

FT ACT_SITE 510 510 BY SIMILARITY. 

SQ SEQUENCE 545 AA; 62302 MW; 7DB544D95FD1D237 CRC64; 

Query Match 10.6%; Score 74.5; DB 1 ; Length 545; 

Best Local Similarity 25.3%; Pred. No. 9.4; 

Matches 37; Conservative 19; Mismatches 45; Indels 45; Gaps 6; 

Qy 23 GMHNLL LLEGRSWQEMDGQ KKHWKDKWDLLYWRDIKKTGWFGASL 69 

I I :| ::|:|::| I: | ||: || : | | || 

Db 411 GHHEILFSNVLAQAQAFMKGKSYEEALGELLSKGLDKDET^DLAHHR VFFGNRP 4 64 

Qy 70 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSD 112 

: I I I : : I : I I | : : I I | | | 

Db 465 SNILLLEKISPSNIGALVALYEHKVFV QGVIWDINSFDQWGVELGKELAVPILQE 519 



Qy 113 -EGHPFRAYLESEVAISEELVQKYSN 137 

III II : I : I : : I I 

Db 520 LEGHKSNAYFDSS TRHLIELYKN 542 



RESULT 15 
YC73_HAEIN 

ID YC73_HAEIN STANDARD; PRT; 268 AA. 

AC P44150; 

DT Ol-NOV-1995 (Rel. 32, Created) 

DT Ol-NOV-1995 (Rel. 32, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Hypothetical protein HI1273. 

GN HI1273. 

OS Haemophilus influenzae. 

OC Bacteria; Proteobacteria ; Gammaproteobacteria ; Pas teurellales ; 

OC Pasteurellaceae; Haemophilus. 

OX NCBI_TaxID=727; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Rd / KW20 / ATCC 51907; 



RX MEDLINE=95350630; PubMed=7 5428 00; 

RA Fleischmann R.D., Adams M.D,, White O,, Clayton R.A. , Kirkness E,F., 

RA Kerlavage A.R., Bult C.J., Tomb J.-F., Dougherty B.A., Merrick J.M. , 

RA McKenney K. , Sutton G., Fitzhugh W. , Fields C.A., Gocayne J.D., 

RA Scott J.D., Shirley R. , Liu L.-I., Glodek A., Kelley J.M., 

RA Weidman J.F., Phillips C.A., Spriggs T., Hedblom E,, Cotton M.D., 

RA Utterback T.R., Hanna M.C, Nguyen D.T., Saudek D.M., Brandon R.C, 

RA Fine L.D., Fritchman J.L., Fuhrmann J.L., Geoghagen N.S.M., 

RA Gnehm C.L., McDonald L.A. , Small K.V., Eraser CM., Smith H.O., 

RA Venter J.C. ; 

RT "Whole-genome random sequencing and assembly of Haemophilus influenzae 

RT Rd . " ; 

RL Science 269:496-512(1995). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license(3isb-sib , ch) . 

CC 

DR EMBL; U32807; AAC22921.1; -. 

DR PIR; F64024; F64024, 

DR TIGR; HI1273; 

DR InterPro; IPR000051; SAM_bind. 

KW Hypothetical protein; Complete proteome, 

SQ SEQUENCE 268 AA; 30510 MW; E5B28DA7AADC4D0B CRC64; 

Query Match 10.5%; Score 74; DB 1; Length 268; 

Best Local Similarity 25.2%; Pred. No. 5; 

Matches 29; Conservative 15; Mismatches 49; Indels 22; Gaps 4; 

Qy 8 VSCLRENFAVYSVSVGMHNLLLLEGRSWQEMDGQKKHWKDKWDLLYWRDIKKTGWFGA 67 

: I I I : I : I : I III j | : : || : 

Db 94 LDCL AQFKQKFGLHHLTTFH KSWADN WDDVPQADWLAS 132 

Qy 68 SLFLLLSL-TVFSIVSVTAYIALALLSVTISFRIYKGVIQAIAKSDEGHPFRAYL 121 

I : I : : I : I I I I : : II : I I : I I I | | 

Db 133 RSTLVDDLDDMIEKLCAKAKKRVFLTSVTQRHFLDEGVFEAIGREDIGFPTYIYL 187 



Search completed: September 29, 2004, 18:14:57 
Job time : 12.0213 sees 



